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ABSTRACT 



Since overhead costs constitute a large percentage of 
total cost for aerospace contractors, it is important to be 
able to predict them accurately. The research performed in 
this thesis takes six government aerospace contractors and 
obtains regression models of their overhead costs that can 
be utilized for forecasting purposes. This method is prefer- 
able to some of the more commonly used methods because it 
estimates overhead costs directly, eliminating reliance upon 
predicted overhead rates. After the data were transformed 
to eliminate the effects of autocorrelation, excellent 
structural results were obtained for five of the six aero- 
space contractors. A Monte Carlo simulation was performed 
to compare various estimators of the autocorrelation. Two 
of the estimators were found to be superior. These two esti- 
mators are both two-stage estimators that are calculated 
utilizing Wallis's test statistic for fourth-order 
autocorrelation . 
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I. INTRODUCTION 



The purpose of this thesis is to analyze the overhead 
costs of six government aerospace contractors, determine the 
best estimators of autocorrelation found in the residuals, 
and then obtain regression models for the overhead costs. 
The most frequently used method to estimate overhead costs 
is to use estimated overhead rates. These rates are 
applied to estimated labor hours or costs in several func- 
tional categories. Summing all of these category values 
gives total overhead. This method results in poor 
estimation in cases where the firm's output fluctuates 
significantly as in the case of government aerospace 
contractors. Consequently, it is unsatisfactory for use with 
these six aerospace contractors. Another approach, the one 
utilized in this thesis, is to estimate overhead costs 
directly, and hence eliminate reliance upon overhead rates. 

For aerospace contractors, overhead cost comprises 30 to 
50 percent of total cost. Therefore, it is important to be 
able to accurately predict these costs. It is also desir- 
able that the predictive model be simple to utilize, without 
sacrificing predictive power, since personnel of various 
statistical backgrounds may be required to use it (Boger, 
1983, pp.5-7). 

The work performed in this thesis is a continuation of 
that presented in Boger (1984). The statistical methods and 
procedures used herein to obtain the predictive models have 
already been proven to give good, useful results. There are 
additions in two areas to the data used by Boger. First, 
data were obtained for two additional quarters and, 
secondly, data were obtained for one additional contractor. 
The major extension on Boger, and the emphasis of this 
thesis, is in determining the best method to use to estimate 
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autocorrelation. The focus of the analysis performed herein 
is to derive a simple and efficient regression model for 
overhead cost. 
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II. DATA SOURCES AND CHARACTERISTICS 



The data were obtained from six government aerospace 
contractors. To maintain confidentiality, all references to 
specific contractors will be with the labels A through F. 
Prior to obtaining the data, a specific format for data 
collection was selected to insure uniformity of data catego- 
ries among the contractors. The overhead cost data were then 
collected on a quarterly basis for the selected categories 
from each of the contractors, starting with the first 
quarter of 1979 through the fourth quarter of 1984. The 
data for the last two quarters of 1984 were unavailable for 
contractor E. The data for the last two quarters of 1984 
for contractor B were eliminated from further analysis 
because they were clearly outliers. In addition to the cost 
data, data pertaining to production and operating character- 
istics were obtained. 

There are three major categories, two of which are made 
up of several subcategories, which comprise total overhead 
costs. The first major category, labor related- costs , has 
subcategories of indirect salaries and fringe benefits. The 
second major category, facilities costs, includes all 
facilities-related costs. The last major category, the mixed 
costs category, has three subcategories. These three subca- 
tegories are Independent Research and Development and Bid 
and Proposal (IR&D/B&P) costs. Electronic Data Processing 
(EDP) costs, and a subcategory that contains all other costs 
related to overhead. 

The cost data were then converted from current dollars 
to constant fourth quarter 1984 dollars. This conversion was 
accomplished using Bureau of Labor Statistics (BLS) and 
Gross National Product Deflator (GNPD) indices for the 
appropriate categories. The labor- related costs were 
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converted using BLS SIC 372, the price index for production- 
worker average hourly wages for the aircraft and parts, 
industry. In this case the only indices available were 
monthly indices. The monthly indices were then averaged to 
obtain quarterly values. The facilities costs were adjusted 
using the GNPD gross private domestic fixed nonresidential 
investment index, published by the Bureau of Economic 
Analysis. The mixed costs were adjusted using the GNPD 
personal consumption services expenditure index. As with 
all indices, those used here are imperfect. They were 
selected because they should provide the best adjustments 
for inflation among all readily available and relevant 
indices . 

As previously mentioned, data in different production 
and operational categories were also obtained. The only one 
used in this analysis was the direct labor personnel 
category. Due to the nature of this data, no adjustment was 
necessary . 
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III. MODELING QUARTERLY OVERHEAD COST 



A. PRESENCE OF AUTOCORRELATION 

Whenever a statistical model is based upon time series 
data, as is the case herein, the residuals can be expected 
to exhibit some form of autocorrelation. Further, "the 
shorter the periods of individual observations, the greater 
the likelihood of encountering autoregressive disturbances" 
(Kmenta, 1971, p.270). Thus the presence of autocorrelation 
is more likely with quarterly observations than when the 
data are smoothed by reporting annual values. The most 
common assumption about the form of the autocorrelation of 
the errors terms is that the errors are first-order autore- 
gressive (Judge et . al., 1985, p.275). This model is called 
an AR(1) process and possesses the form 

et = PlEt-1 ^ «t, (3*1) 

where is the error term from a regression model corre- 
sponding to the observation at time t. The term is the 

coefficient of correlation between the related error terms, 
and of lag 1. The are normally and independently 

distributed random variables with mean 0 and constant 
2 

varxance • 

With autocorrelation the assumption that the error 

terms, are independent identically distributed normal 

^ ’ 2 
random variables with mean 0 and variance a is not true. 

They are related to previous error terms and depend upon the 
form of autocorrelation present. Secondly, when the auto- 
correlation is AR(1), the variance of the error terms is 
Qy^/Cl - pi^) (Kmenta, 1971, p.271). 

When the error terms are autocorrelated the least 
squares estimators of the regression coefficients are still 
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unbiased and consistent, but they are no longer efficient or 
asymptotically efficient. The standard estimates of the 
variances of the coefficients are also biased. When the 
error terms are positively autocorrelated , as is most common 
for economic time series data, this variance will be biased 
downward (Kmenta, 1971, pp. 278-283). This will cause the 
coefficient of determination, R-squared, and the t and 
F-statistics to be exaggerated (Maddala, 1977, p.283). The 
upward bias in each of these statistics leads to an unwar- 
ranted confidence in the regression model. 

The adverse effect of an overestimated value of 
R-squared is obvious since the higher the R-squared value, 
the better fit the model is assumed to give. An upwardly 
biased t-statistic makes it easier to reject, for any given 
level of significance, the null hypothesis that the regres- 
sion coefficient equals zero, thus, again giving the false 
impression of a more significant regression model. The 
effect of an exaggerated F-statistic is similar to that of 
the t-statistic. It is easier to reject the standard 
compound null hypothesis that all regression coefficients 
equal zero for the F-test. 

So, in addition to obtaining unbiased estimates of the 
regression coefficients, it is equally important to obtain 
unbiased estimates of their standard errors. Only then can 
reliable statistics be obtained that can accurately assess 
the quality of the regression model. 

As the AR(1) model indicates, the error terms in one 
period are related to those occurring one period prior. The 
AR(1) process is common in yearly economic data. When the 
data observations are quarterly, a special form of the 
fourth-order autoregressive, SAR(4), process will be present 
instead of the standard AR(4) process (Wallis, 1972, p.618). 
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This special SAR(4) process has the form 



Ej- = p4Et-4 Ot • (3.2) 

With this form of autocorrelation the error terms are 

related to those occurring in the corresponding quarters of 
successive years. Using the notation of Box and Jenkins 
this is a seasonal model of order ( 1 , 0 , 0 )x(0 , 0 , 0 )^ . The 

period equals four since the data is quarterly and can be 
expected to show seasonal effects within years (Box and 
Jenkins, 1970, pp. 301-305). When this SAR(4) process is 

p p 

present the variance of the error terms is Gy^/(1 " P4^) 

(Judge et . al . , 1985, p.298). This special SAR(4) process 

should be distinguished from the general AR(4) process which 
possesses the form 

et = Pl^t-l ^ P2et-2 ^ Pl^t-J P4St-4 «t • (^.3) 

The SAR(4) process used in this analysis considers the 
effect of the three previous quarters to be negligible 
compared to that of the same quarter of the previous year 
(Boger, 1983, p.l6). 

Time series plots of the dependent variable (Figures 
3.1a and 3.1b) total overhead costs, suggest that the Wallis 
SAR(4) model is appropriate for this problem. As can be 
seen, the data show a seasonal effect within years. The 
data appear to follow a yearly cycle in which the quarterly 
values of successive years fall in the same relative posi- 
tion with respect to the remaining three quarters of their 
respective years. 

The time series plots of the independent variable 
(Figures 3.2a and 3.2b), direct labor personnel, show that 
they all appear to follow some type of cycle or general 
trend. Contractors C, E, and F all appear to go through a 
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Figure 3.1a Time Series Plots of the Dependent Variable. 
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Figure 3.1b Time Series Plots of the Dependent variable. 
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Figure 3.2a Time Series Plots of the Independent Variable. 
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Figure 3.2b Time Series Plots of the Independent Variable. 
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single cycle. They start with an upward trend for approxi- 
mately eight quarters, followed by a gradual decline for 
about twelve quarters, and end with the start of another 
upward swing. Contractor A appears to run through two 
shorter cycles. Contractors B and D both appear to follow a 
gradual, steady upward trend. 

B. THEORETICAL MODEL UTILIZED 

The general model utilized for overhead costs in this 
analysis is of the form 



= XtJ3 + £t (3.4) 
£t = Pi£t-i ^ »t, (3-3) 

where is a tx2 matrix and ^ is a 2x1 vector. is the 
dependent variable, total overhead costs. The columns of X^ 
are the independent variable, direct labor personnel, 
preceded by a column of I's for the constant term. Only one 
independent variable is utilized because it satisfactorily 
explains the dependent variable and minimizes the complexity 
of the model. The error term has the structure shown in 
equation (3.5) where i is 1 for an AR(1) process, or 4 for 
Wallis's special SAR(4) process. The are normally and 
independently distributed random variables with mean 0 and 
constant variance. 

As previously mentioned, when autocorrelation is present 
the estimators of the regression coefficients are not 
efficient and their variances are biased. If p is a known 
quantity, then the X and Y variables can be transformed to 
eliminate the effect of the autocorrelation. A regression of 
these new transformed variables, the Generalized Least 
Squares (GLS) solution, yields results that correct these 
deficiencies. However p is seldom, if ever, known. This 
difficulty can be overcome by estimating p from the data. 
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This yields the Estimated GLS (EGLS) solution that also 
possesses the desired properties (Kmenta, 1971, p.284). 

C. ESTIMATORS OF FOURTH- ORDER AUTOCORRELATION 

One of the major thrusts of this analysis is to arrive 
at the most efficient estimate of . All of the estimators 
considered fall into one of two categories, iterative or 
maximum likelihood procedures. All but one of the iterative 
estimators can be more specifically classified as two-stage 
estimators. The theory behind maximum likelihood estimation 
can be found in most intermediate level probability and 
statistics texts. The basic approach to performing the 
iterative, including the more specific two-stage, procedure 
is presented in Kmenta (1971, p.288). The first seven esti- 
mators are two-stage estimators, while the eighth estimator 
utilizes the full iterative procedure. The ninth estimator 
is the maximum likelihood estimator. 

Since the AR(1) process is the most frequently encoun- 
tered form of autocorrelation, most of the estimators of 
used herein are simply adaptations of their corresponding p-j^ 
estimator. Judge et . al. (1985) derives six estimators for 
the AR(1) process case. Adaptations of the estimators of 
Judge are given below. 

The first estimator of p^ is the standard Prais-Winsten 
estimator 



^4 = 









(3.6) 



where e^=Y^-X^P, the residual from the OLS regression in 
equation (3.4). This estimator is simply the sample 
correlation coefficient when the residuals possess the auto- 
correlation process shown in equation (3.5) for i=4. The 
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AR(1) process version of this 
downwardly biased estimate of 
p.l89). So we could expect 
equation (3.6) to estimate p^. 

The second estimator of 

(T-K) 

P4 = ^ ’ 



estimator is known to give a 
pj^ (Park and Mitchell, 1980, 
this same result when using 

is 



(3.7) 



where T is the number of observations and K the number of 
parameters that must be estimated, in this case 2. This is 
simply a modification to equation (3.6) derived by Theil to 
incorporate a degrees -of-freedom correction. As can be seen 
it will further increase the downward bias of equation 
(3.6). 

The third estimator of p^ is 



^4=1- .5d4, (3.8) 

where d 4 is the Wallis test statistic for the special SAR(4) 
process. As presented in Wallis (1972) the equation for d 4 
is 



d4 = 



T 



- e 



t-4) 



til^t- 



(3.9) 



So equation (3.8) is easily computed once the test 
statistic, d 4 has been calculated. 
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The fourth estimator of 



P4 



IS 



^ T^(l - .5d4) t 

P4 ^ _ J.2 



(3.10) 



This estimator is a modification to equation (3.8) derived 
by Theil and Nagar. It is an improvement over equation (3.8) 
if the explanatory variables are smooth, "in particular, if 
the first and second differences of each explanatory vari- 
able are small in absolute value compared to the range of 
the variable itself" (Intriligator , 1978, p.l64). 

The fifth estimator of is the Durbin estimator. It is 
the coefficient of obtained in a regression of on 

Y|._ 4 , a constant, , and 

Yt - P4^t-4 * Pl^^”P4^ ^ Pz^t " PzP4^t-4 ^ ^t ’ (3-H) 

t = 5,6,. ..,T. 

The sixth estimator of is obtained from an OLS 

regression of e^. on 

= p4e^._4 + Oj- » t = 5,6,..., T. (3.12) 

The seventh estimator of p^ is an adaptation of the Park 
and Mitchell (1980) estimator: 



P4 



Z^e^-e 



t = 5 



t^t-4 



T-1 

t?5^' 



(3.13) 



As can be seen this is a modification of equation (3.6) 
where the summation in the denominator excludes the first 
four and the last observation. This will reduce the downward 
bias associated with equation (3.6). 
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The eighth . estimator of is the iterative 
Prais -Winsten estimator utilizing equation 3.13. This is the 
same as estimator seven except that the iterative procedure 
is carried out until convergence is achieved. For this 
estimator, as well as the maximum likelihood estimator that 
follows, convergence was defined to be consecutive estimates 
within I .000011 of each other. Each procedure was defined to 
be nonconvergent if convergence had not occured within 
fifty-two iterations (Park and Mitchell, 1979, pp.2-6). 

The maximum likelihood estimator, the ninth estimator 
of p 4 > was obtained using the iterative algorithm derived by 
Beach and MacKinnon (1978). The specific procedure they 
derived was for the AR(1) process, so again minor adjust- 
ments had to be made to tailor it to the SAR(4) process. The 
only alterations necessary were in the calculation of the 
coefficients of the polynomial 



f(p) = + Ap^ + Bp + C = 0. 



(3.14) 



The coefficients are now computed as follows: 



T 




/ DENOM 



(3.15) 




/ DENOM (3.16) 



T 

C = TJ^etet_4 / DENOM, 



(3.17) 



where the common denominator is 



DENOM = (T-l)(2 e.4^ - I e.^). 

t=5 ^ ^ t=l ^ 



(3.18) 
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The remainder of the algorithm is the same as that presented 
in Beach and MacKinnon (1978). Beach and MacKinnon adver- 
tise that this algorithm should converge in four to seven 
iterations for five digit accuracy. 

D. TRANSFORMATION FOR AUTOCORRELATION 

If first or fourth-order autocorrelation is shown to 
exist in the residuals, then the data must be transformed to 
eliminate its presence. The transformation used for an 
AR(1) process is discussed in Judge et.al. (1985, p.285). 
It should be noted here that only one estimator for was 
used, the two stage Prais -Winsten estimator derived in Park 
and Mitchell (1980) 



Pi = 






t = 2 



t^^t-l 



T-1 . 

t = 2 ^ 



(3.19) 



This is the AR(1) process version of equation (3.13). This 
particular estimator was selected because it was found to 
perform better than any other commonly used alternative, 
including the more standard Prais-Winsten estimator (see 
Judge et . al., 1985, p.286), in Park and Mitchell's 1980 

analysis . 

For the SAR(4) process the transformation is 

= Zt(l-p 42 )l/ 2 , t = 1,2, 3, 4 and (3.20) 

Z^ - - p^Z^ -4’ b=5,6,...,T, (3.21) 

where p^ must be estimated using any of its known estima- 
tors. Some estimators, of course, are more efficient than 
others. This is one item to be resolved for the specific 
models used herein. 
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Note that both of these transformations utilize all T 
observations. An alternative approach is to omit a number 
of the initial observations in the transformation (see 
Cochrane and Orcutt, 1949, p.35). The number of omissions is 
dependent upon the type of autocorrelation found present in 
the data. The first observation is omitted when the data are 
being transformed to eliminate the presence of an AR(1) 
process. The transformation for an SAR(4) process would 
result in the omission of the first four observations. 

It has been shown that the use of all T observations 
generally results in much more efficient results (Spitzer, 
1979, and Park and Mitchell, 1980). It should be noted that 
these results were arrived at by studying the AR(1) process 
transformation where only the first observation is involved. 
The effects should be even more dramatic for the SAR(4) 
process transfoirmat ion , since the first four observations 
are involved. It is also worth noting that the relative 
efficiency of these two alternative transformations is 
related to the specification of the independent variable 
(Maeshiro, 1979 and Taylor, 1981). Maeshiro found that in 
the case where the independent variable is trended and p=0 
(as is most commonly the case with economic data) the reten- 
tion of all T observations greatly increased the efficiency 
of the estimator. He also found that retention of the 
initial observations was not as critical for the case of an 
untrended independent variable. 

E. PROCEDURE 

The general procedure followed herein was to first 
perform an Ordinary Least Squares (OLS) regression with 
direct labor personnel as the independent variable and total 
overhead cost as the dependent variable. Direct labor 
personnel was selected as the independent variable because 
it was shown to perform the best among numerous explanatory 
variables in a single variable regression with total 
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overhead cost in Boger's 1983 analysis. The residuals were 
then analyzed and tested for the presence of Wallis's 
special SAR(4) process, the AR(1) process, or a combination 
of both. To do this, the autocorrelation function of the 
residuals was first looked at to get an overall picture of 
the type of autocorrelation present. More formal testing 
was then performed. 

The Durbin-Watson test was used to check for the pres- 
ence of the AR(1) process (see Kmenta, 1971, pp. 295-296). 
The Wallis test, a generalization of the Durbin-Watson test, 
was utilized to check for the presence of the SAR(4) process 
(see Wallis, 1972, pp. 624-625). In both cases a two-sided 
test was performed using the null hypothesis Hq : p=0, versus 
the alternative p^O. A significance level of size a= • 10 
was used to define the critical region. 

One problem with both of these tests is the inconclusive 
region between the upper and lower significance points that 
determines the critical region. The size of this inconclu- 
sive region increases as the sample size decreases or as the 
number of regressors increases (Wallis, 1972, p.625). So in 
this analysis we are handicapped by the small sample size, 
but it is to our advantage here in keeping the number of 
regressors to a minimum. Maddala (1977, pp. 285-286) presents 
numerous suggestions, derived by others, in dealing with 
this inconclusive region for the Durbin-Watson test. In this 
analysis we chose the statistically conservative approach of 
ignoring the lower significance point and using only the 
upper point to define the critical region. This rule was 
followed for both the Durbin-Watson and Wallis tests. This 
method is said to perform well in many situations for the 
Durbin-Watson test (Draper and Smith, 1981, p.l67). As 
presented in Draper and Smith (1981) the rejection criteria 
for the two sided test for this rule are as follows; if d<d^^ 
or 4-d<d^j, reject Hq at level 2a • Any point that would have 
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fallen in the inconclusive region before, would now fall in 
the critical region and lead to the rejection of our null 
hypothesis. This treatment of the inconclusive region is 
also recommended by Hannan and Terrel (1968), They feel that 
the upper significance point is a good approximation for the 
bound on the critical region when the regressors are slowly 
changing. They further state that economic time series, as 
is the case here, are slowly changing so the upper signifi- 
cance point can be used as the lone significance point for 
the Durbin- Wat son test (Maddala, 1977, p.285). In another 
study, Theil and Nagar (1961) computed significance points 
for the Von Neumann ratio of least-squares estimated regres- 
sion disturbances , which is equivalent to the Durbin-Wat son 
test statistic. Their calculated significance points were 
very close to the upper significance points for the 
Durbin-Wat son test. So this also gives added credence to 
using the upper point as the sole significance point in 
performing these tests. Though all of the referenced 
results apply to the Durbin- Wat son test, this rule was used 
on the Wallis test also because, as previously mentioned, 
the Wallis is a slight modification of the Durbin-Watson 
test . 

Next, depending upon the form of autocorrelation found 
present, the data were transformed using the appropriate 
transformation. The EGLS solution was then obtained by 
reestimating the model using the transformed dependent and 
independent variables. Again the residuals of this regres- 
sion were tested for the presence of autocorrelation. This 
cycle of reestimating the model and testing for autocorrela- 
tion was performed until a model was obtained where the 
residuals were free from any autoregressive process. Once 
this final model was obtained, the residuals were checked to 
insure that they were independent, identically distributed. 
Normal random variables with zero mean and constant 
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variance. In all cases this requirement was met. 
final models met all of the necessary assumptions 
of a correct, reliable regression model. 



So the 
required 
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IV. SMALL- SAMPLE PROPERTIES OF SEVERAL ESTIMATORS OF SAR(4) 



A . GENERAL 

As previously mentioned, one of the purposes of this 
thesis is to determine the best estimators of fourth-order 
autocorrelation. This is important because the performance 
of the EGLS regression is dependent upon the quality of the 
estimator (Rao and Griliches, 1969, p.258). In order to 
evaluate the nine estimators presented in Chapter 3, a Monte 
Carlo simulation was carried out to determine their relative 
performances. This chapter explains the simulation and 
presents the results. The three estimators that performed 
the best in this simulation were then used to obtain struc- 
tural models for each contractor. These three models then 
provided another basis of comparison for the three final 
estimators. The computer programs utilized in the simula- 
tion are contained in the appendix. 

B. RESULTS FROM PREVIOUS MONTE CARLO ANALYSIS 

No previous simulations comparing estimators of fourth- 
order autocorrelation could be found. Therefore the results 
of this simulation could only be compared with results 
obtained from previous simulations that evaluated various 
estimators of first-order autocorrelation. Even though 
these past Monte Carlo simulations dealt with the AR(1), 
instead of SAR(4), process estimators, their results should 
still be useful in predicting the relative performance of 
the various estimators of . Of the nine estimators evalu- 
ated in this thesis, results comparing the AR(1) process 
versions of only estimators one, five, seven, eight and nine 
could be found. As will be shown later in this chapter, 
estimators three and four proved to be the best of the nine 
estimators tested herein. It would have been interesting to 



29 



see how their first-order autocorrelation versions would 
have compared in these previous studies. 

The Spitzer and Rao and Griliches studies compared the 
Durbin and standard Prais -Winsten estimators of first-order 
autocorrelation for Mean Squared Error (MSE) of P2* both 
analyses the Prais-Winsten estimator performed better for 
lower positive values (p]^<-5) while the Durbin estimator 
dominated for the higher values. The Spitzer study also 
included the maximum likelihood estimator and showed that it 
was better than both the Durbin and Prais-Winsten estimators 
for for MSE of P£ > ^.nd better than the Durbin esti- 

mator for p]^>*3 for MSE of p^ (it wasn't compared to the 
Prais-Winsten estimator for MSE of p]^). Park and Mitchell 
compared four estimators in their 1980 analysis for RMSE of 
and p£. The estimators they analyzed were the iterative 
Prais-Winsten, the maximum likelihood, their version of the 
Prais-Winsten (Equation (3.19)), and the standard 
Prais-Winsten estimators. Their maximum likelihood estimator 
was computed utilizing Beach and MacKinnon's algorithm. The 
iterative Prais-Winsten estimator was the best of the four 
with a slight edge over the maximum likelihood estimator. 
Since the iterative Prais-Winsten estimator outperformed its 
two-stage counterparts it was shown that iteration leads to 
a more efficient estimator. Of the two stage estimators, 
their version of the Prais-Winsten estimator was better than 
the standard version. Park and Mitchell's 1979 study showed 
that the iterative Prais-Winsten was also better than the 
maximum likelihood estimator for MSE of p-j^. All of these 
studies showed that all of the estimators outperformed the 
OLS solution when a significant amount of autocorrelation 
was present in the residuals (pQ^>.2). 
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C . THE MODEL 



The model utilized in the simulation was 



= Pi + P2Xt ^ St 



(4.1) 



8t P4^t-4 ^ ^t ’ 



(4.2) 




I I <1.0. Three different independent variables were 



utilized in the simulation. They were the direct labor 
personnel for contractors A, B, and E. These were selected 
because they were three of the models where the SAR(4) 
process was found to be the most significant form of auto- 
correlation present in the residuals. A separate simulation 
run was performed for each of these so that the relative 
performance of the estimators could be compared for indepen- 

p 

dent variables with different structures. The value of <j^ 
was particular to the contractor for which the simulation 
run was performed. 

It was desired that the generated dependent variable, 
, be comparable in value and structure to the total 
overhead cost for the respective contractor (the dependent 
variable, Y^. , in equation (3.4)). Therefore the terms of 

equation (4.2) had to be proportional to the terms of 

p 

equation (3.5). To accomplish this, the value of was the 

variance of the residuals, > obtained from the OLS regres- 
sion of e^ on e^-_^: 



The variables e^ and were the (unlagged and lagged) 
residuals obtained from the OLS regression in equation 
(3.4). The sample size, T, was simply the number of data 
observations for each contractor, twenty-four for contractor 
A and twenty-two for contractors B and E. Each simulation 
was replicated one hundred times. 



®t = P4®t-4 



(4.3) 
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D. DATA GENERATION 



For each simulation the independent variable, Xj. , the 
regression coefficients, and P 2 , and were predeter- 
mined, fixed values. As previously mentioned the three 
different independent variables were the direct labor 
personnel for contractors A, B, and E. Each simulation was 
run for values of of .1, .2, .3, .4, .5, .6, .7, .8, .9 
and .95. The value of p^ was restricted to positive values 
because this is the region most likely to be encountered for 
the SAR(4) process with economic data. As in the simulation 
performed by Rao and Griliches in 1969, the constant term, 
P]^, was set at zero. The value of the slope, p£, was then 
set at a value that generated dependent variables, , 
proportional in size to the respective contractor’s total 
overhead cost. The value of P 2 for each simulation, is 
contained in Table 1. 



TABLE 1 

SIMULATION PARAMETERS 



Simulation 


run 


P2 


^0 


Contractor 


A 


17 


13454 


Contractor 


B 


17 


8835 


Contractor 


E 


14 


3444 



With the ultimate goal of determining Y^. , the data were 
generated using the following algorithm: 

(1) Thirty-two values were randomly generated from a 

p 

normal distribution with mean 0 and variance • The value 
of the standard deviation, (jy > for each simulation, is shown 
in Table 1. New values were drawn for each of the 100 

replications . 
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(2) The first four values of were computed to be 

= Ut / t = 1,2, 3,4. (4 .4) 

This generated error terms from the normal distribution with 
mean 0 and variance ( 1 “P 4 ) • 

(3) The next twenty-eight (twenty-four for contractors B and 
E) values of were generated from 

— P4^t-4"*^^t’ ^ ~ 5,6,...,T. (4.5) 

A total of T + 8 values of were generated. The first 

eight of these values were dropped so that the first four 
values, generated by step (2), didn't excessively influence 
the sample (Spitzer, 1979, p.46). This left us with T 

values of e^- that possess the SAR(4) process shown in 
equation (4.2). 

(4) Using P]^=0, the respective P2> ^t g^rie^^^-ted 

A 

above, the dependent variable , was generated as follows 

= Pi * P2^t * £f ('*•«) 

An OLS regression was then performed with direct labor 
personnel as the independent variable and the dependent 

variable. The residuals from this regression were then used 
to compute the nine estimators of p^. As in Park and 
Mitchell’s 1979 analysis any estimator that equaled or 
exceeded 1.0 was reset to .99999. For the two iterative 
algorithms, the iterative Prais-Winsten and the maximum 
likelihood, if two consecutives estimates exceeded 1.0 they 
were both reset to .99999 and convergence was declared. The 
results for these two estimators are only for the cases when 
convergence was attained. 
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E. MEASURES OF EFFECTIVENESS 

Three MOEs were used to determine the relative perform- 
ances of the estimators, the RMSE of , the RMSE of the 
slope coefficient, the adjusted R-squared value. 

Each MOE was computed and averaged over the one hundred 
replications for each value of p^. 

To evaluate how accurately each estimator predicted p^ 
and its variance the RMSE of p^ was computed (Rao and 
Griliches, 1969). The equation is 

RMSE (p4) = tf°(^4-p4)^/100]l/2. (4.7) 

The RMSE of evaluated the estimators in terms of 

their performance for the slope coefficient, ^2 make 

comparisons easier, the performance of each estimator (EGLS 
solution) relative to that of the OLS solution was computed. 
As presented in Park and Mitchell (1980) the relative effi- 
ciency is 

RMSE (Bo, OLS) 

Rel. Eff. (Bo, estimator.:) = ^ (4.8) 

^ RMSE (p2, EGLS^) 



where 



RMSE (P 2 ) = / 100] 1/^. 



(4.9) 



The last MOE utilized was the adjusted R-squared value 
from each EGLS regression. Though three MOE's were 
utilized, the RMSE of p^ was considered the most important. 
The other two were considered only if the RMSE of p^ was 
sufficiently close for alternative estimators. 
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F. RESULTS 



1 . General 

The results for the three simulations are summarized 
in Tables 2 through 4 and Figures 4.1a through 4.3c. The 
three different simulation runs are identified by the 
contractor label. Since each simulation was run with a 
different independent variable, each having its own unique 
structure, the results vary slightly between simulations. 

The two iterative procedures had slight convergence 
problems at low values of (p 4 <- 3 ). This could possibly 
have occured because at low values of the error terms , 
e^. , still do not have a significant SAR(4) process struc- 
ture. So the first estimate could be a poor one and the 
iterative procedure could proceed in the wrong direction 
(most likely toward negative values of P 4 ). The convergence 
problem decreased as p 4 increased, such that by p 4=.7 
convergence occured almost 100 percent of the time. For both 
estimators convergence generally occured in four to six 
iteratiohs. 

2 . Estimation of Rho 

The RMSE of p^ was used to determine which estima- 
tors provided the best estimate of P 4 . As can be seen in 
Table 2 and Figures 4.1a through 4.1c, no estimator was 
uniformly the best. Estimators three and four, the two 
estimators that utilized the Wallis test statistic, 
appeared to be vastly superior over the entire range of p 4 . 
They were only outperformed at the extreme low end by 
estimators one and two, the two versions of the sample auto- 
correlation coefficient. Estimator three was the best in the 
range « 2 <p 4 <. 5 . A crossover occured at p^ equals .6 and 
estimator four dominated for the upper range of p^. An 
exception to this was for the contractor A simulation where 
estimator nine was the best for p 4 =. 95 . Estimator nine, the 
maximum likehood estimator, appeared to be the third most 
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TABLE 2 



RMSE OF RHO FOR THE THREE SIMULATION RUNS 



Contractor A 



P4 


1 


2 


Estimator 
3 4 5 


6 


7 


8 


9 


.1 


. 198 


.191 


.195 .2 


.218 


.226 


.244 


.193 


.224 


.2 


.183 


.18 


.17 .175 


.214 


.2 


.22 


.2 


.217 


.3 


. 204 


.203 


.171 .175 


. 209 


.209 


.227 


. 213 


.226 


.4 


.209 


.212 


.154 .157 


. 198 


. 207 


. 209 


. 198 


.214 


. 5 


.233 


.24 


.153 .154 


.199 


. 214 


.218 


.209 


.21 


.6 


.286 


. 295 


.184 .181 


.235 


.259 


.253 


.228 


.213 


.7 


.33 


. 342 


.207 .203 


.26 


.288 


.284 


.254 


.236 


. 8 


.286 


.302 


.162 .16 


.217 


.235 


. 217 


.193 


.167 


.9 


.283 


.305 


.134 .129 


.195 


.213 


. 184 


. 163 


.126 


.95 


.243 


.27 


.097 .092 


.133 


.149 


.134 


. 112 


. 070 



P4 


1 


2 


Contractor B 
Estimator 
3 4 5 


6 


7 


8 


9 


.1 


.212 


.204 


.215 


.22 


.242 


.248 


.256 


.216 


.242 


.2 


. 224 


. 219 


.204 


.208 


. 249 


.255 


.257 


. 201 


.229 


.3 


.241 


. 239 


. 184 


. 188 


.257 


.256 


.26 


. 206 


.217 


.4 


.285 


. 284 


. 216 


.217 


. 271 


.286 


.305 


. 249 


.244 


. 5 


.278 


.283 


. 193 


.193 


.265 


.283 


.278 


.24 


. 246 


.6 


. 264 


.274 


. 164 


. 164 


.213 


.228 


.234 


.225 


.217 


.7 


.324 


.337 


.196 


.192 


.252 


.275 


.287 


.26 


. 233 


. 8 


.302 


.319 


.172 


. 169 


.222 


.245 


.236 


.207 


. 18 


.9 


.29 


.314 


.141 


.137 


.203 


.216 


.192 


.176 


.15 


.95 


.254 


.283 


. 105 


. 101 


. 144 


.153 


. 148 


. 131 


. 106 



P4 


1 


2 


Contractor E 
Estimator 
3 4 5 


6 


7 


8 


9 


. 1 


.21 


.202 


.228 .234 


.265 


.252 


.263 


.236 


.264 


. 2 


.213 


. 208 


.213 .219 


.263 


.245 


.253 


.224 


.2 53 


. 3 


.23 


.228 


.185 .191 


.253 


. 244 


.258 


.226 


. 24 


. 4 


. 271 


. 272 


.204 .207 


. 286 


. 275 


. 292 


. 261 


. 268 


, 5 


. 26 


.265 


.174 .176 


.269 


.261 


.262 


. 229 


. 237 


. 6 


.252 


.263 


.154 .156 


.212 


.214 


.22 


.215 


. 216 


. 7 


.314 


.327 


.174 .171 


.254 


.261 


.27 


.256 


.23 


. 8 


.302 


.32 


.159 .156 


.228 


. 246 


. 229 


. 205 


. 179 


. 9 


.286 


.311 


.125 .12 


. 196 


.206 


. 169 


.152 


. 12 


.95 


.259 


.288 


.105 .101 


.15 


.156 


. 148 


. 13 


. 106 
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Figure 4.1a RMSE of Rho for Simulation for Contractor A. 
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Figure 4.1b RMSE of Rho for Simulation for Contractor B. 
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Figure 4 . Ic 



RMSE of Rho for Simulation for Contractor E 



efficient, with the iterative Prais-Winsten estimator a 
close fourth. It was interesting to compare the performance 
of the standard Prais-Winsten estimator, estimator one, with 
that of Park and Mitchell's version, estimator seven. 
Estimator one was better in the lower range (p^<-5) with 
estimator seven better from that point on. Overall though, 
it appeared that estimator seven was the better of the two. 
It could also be observed that all estimators except one and 
two improved as increase. Estimators one and two 
performed better at the lower end (p 4 =-l). Recall that 
these two estimators are known to be downwardly biased in 
their AR(1) process forms. 

3 . Estimation of Slope Coefficient 

Most noteworthy was the fact that all estimators 
provided more efficient estimation than did the OLS solution 
for p^>.2. As can be seen in Table 3 and Figures 4.2a 
through 4.2c, none of the estimators dominated over any 
significant range of p^. The two best performers appeared 
to be estimators eight and nine. These two estimators poss- 
essed a slight edge over estimators three and four. The 
performances of the remaining estimators, except for estima- 
tors one and two which were clearly inferior, were very 
comparable and no significant distinctions could be drawn. 

4 . EGLS Regression Quality 

The adjusted R-squared value was selected as the MOE 
to evaluate the quality of the EGLS regression. Again, as is 
shown in Table 4, all estimators provided better results 
than did the OLS solution. As can also be seen in Figures 
4.3a through 4.3c estimators three and four were again 
vastly superior. This time estimator four performed the best 
until it was surpassed by estimator three at approximately 
p^ equal .8. Estimator nine again appeared to be the third 
best performer. Except for estimators one and two, which 
were again slightly inferior, the remaining estimators were 
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TABLE 3 



RELATIVE EFFICIENCIES FOR THE THREE SIl^LATION RUNS 



Contractor A 
Estimator 



P 4 


1 


2 


3 


4 


.1 


1.05 


1.05 


1.02 


1.02 


.2 


1.07 


1 . 07 


1.02 


1.02 


.3 


1.05 


1.04 


1.08 


1.08 


.4 


1.1 


1.09 


1.08 


1.08 


.5 


1.14 


1.13 


1 .21 


1.21 


. 6 


1 .25 


1.24 


1.33 


1.33 


.7 


1.19 


1 .18 


1 . 21 


1.21 


.8 


1.31 


1.3 


1.37 


1.37 


.9 


1.56 


1 . 54 


1.69 


1.7 


.95 


1.38 


1.37 


1.41 


1.41 



5 6 7 8 9 



1.06 


1 . 07 


1.08 


1.02 


1.02 


1.04 


1 .08 


1.07 


1.08 


1.07 


1 . 06 


1.05 


1.05 


1.03 


1.01 


1 .11 


1 . 11 


1.1 


1.09 


1.08 


1.17 


1 .18 


1.18 


1.19 


1.22 


1 .33 


1.28 


1.29 


1.35 


1.35 


1.19 


1 . 2 


1 . 19 


1 . 2 


1 .19 


1.37 


1 . 34 


1.36 


1.38 


1 .38 


1 .71 


1.66 


1.67 


1.71 


1.74 


1.41 


1.4 


1.41 


1.42 


1.43 



P 4 1 2 



Contractor B 
Estimator 
3 4 5 6 



7 8 



9 



.1 
.2 
.3 
.4 
.5 
. 6 
.7 
.8 
.9 
.95 



0 . 


,93 


0 , 


,93 


0 . 


,97 


0 . 


,94 


0 . 


,94 


0 . 


,95 


1 . 


,03 


1 . 


,03 


1 . 


,01 


1 . 


,05 


1 . 


,05 


1 




1 , 


.02 


1 , 


.02 


1 . 


, 04 


1 , 


. 06 


1 , 


. 06 


1 . 


.08 


1 , 


.03 


1 , 


.03 


1 , 


.06 


1 , 


.05 


1 , 


.05 


1 , 


.04 


1 . 


.12 


1 , 


. 12 


1 , 


. 14 


1 , 


.16 


1 , 


.15 


1 , 


.18 



0.97 


0.96 


0.95 


0.96 


0.96 


0.99 


1.01 


1.03 


1.04 


1 


1.01 


1.04 


1.04 


1.04 


1.02 


1.08 


1.06 


1.07 


1.06 


1.06 


1.04 


1.03 


1.04 


1.05 


1.14 


1.15 


1.15 


1.18 


1.17 


1.17 



0.98 0.98 0.97 
0.97 1.07 1.08 
1.04 1.05 1.04 

1.03 1.02 1.01 

1.02 1.06 1.05 
1.08 1.07 1.08 

1.03 1.08 1.09 

1.03 1.02 1.03 
1.12 1.13 1.14 
1.18 1.18 1.18 



Contractor E 
Estimator 



12 3 4 



.1 


1.02 


1.02 


0.99 


0.98 


.2 


1.06 


1 . 06 


1.01 


1.01 


.3 


1 


1 


1 


1 


.4 


1.03 


1.02 


1.08 


1 . 08 


. 5 


1.06 


1 . 06 


1.15 


1.15 


. 6 


1.13 


1.13 


1.15 


1 .16 


.7 


1 .12 


1.11 


1.18 


1.18 


.8 


1.1 


1 . 1 


1.12 


1.12 


.9 


1.27 


1.27 


1.27 


1 . 27 


.95 


1.37 


1.37 


1.36 


1.36 



5 


6 


7 


8 


9 


1 .02 


1.02 


1.01 


1.09 


1.08 


1.06 


1.07 


1.07 


1 . 04 


1.04 


1.02 


1.01 


0.99 


1.1 


1 . 11 


1.07 


1.04 


1.04 


1.06 


1 . 07 


1 .11 


1.07 


1.07 


1 .11 


1.12 


1 .15 


1 . 14 


1 . 14 


1.15 


1.15 


1.15 


1 . 14 


1.15 


1 . 16 


1.17 


1 .12 


1 . 1 


1 . 1 


1.09 


1.09 


1.27 


1.27 


1 .28 


1 . 28 


1.27 


1.36 


1.36 


1.36 


1.36 


1.36 
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Figure 4.2a Relative Efficiencies 
for Simulation for Contractor A. 
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Figure 4.2b Relative Efficiencies 
for Simulation for Contractor B. 
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Figure 4.2c Relative Efficiencies 
for Simulation for Contractor E. 
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ADJUSTED R 


TABLE 4 

-SQUARED FOR THE THREE SIMULATION RUNS 


P4 


1 


2 


Contractor A 
Estimator 
3 4 5 


6 


7 


8 


9 


.1 


.41 


.397 


.509 .514 


. 443 


.421 


.422 


.451 


.473 


.2 


.433 


.426 


.596 .605 


.519 


.46 


.466 


. 505 


.53 


.3 


. 521 


.512 


.669 .677 


. 59 


.556 


. 569 


. 593 


.615 


.4 


.604 


. 594 


.745 .751 


. 701 


.651 


.655 


.691 


.721 


.5 


.644 


.633 


.764 .769 


.716 


.689 


.702 


. 717 


.741 


• 6 


.667 


.654 


.771 .775 


.741 


.709 


.721 


. 742 


. 768 


. 7 


. 705 


.693 


.795 .797 


.758 


.739 


. 744 


. 763 


. 777 


. 8 


.765 


.755 


.805 .802 


.762 


.764 


. 767 


.755 


. 769 


.9 


. 768 


.757 


.779 .766 


. 703 


.719 


.7 


.68 


.697 


.95 


.751 


.737 


.722 .685 


.656 


.687 


. 592 


. 581 


. 6 


P4 


1 


2 


Contractor B 
Estimator 
3 4 5 


6 


7 


8 


9 


.1 


. 574 


. 569 
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Figure 4.3a Adjusted R-squared 
for Simulation for Contractor A. 
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Figure 4.3b Adjusted R- squared 
for Simulation for Contractor B. 
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Figure 4.3c Adjusted R-squared 
for Simulation for Contractor E. 
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all relatively comparable in their overall performance. 
Estimators one and two were the best at the extreme high end 
(p^=.95) in two of the simulations however. In comparing the 
two Prais-Winsten estimators, estimator seven performed 
better for P4<*7, with estimator one better for p4>-7. 
Again estimator seven appeared to be the better of the two 
estimators . 

5 . Summary 

All of these estimators provided an improvement over 
the OLS solution when the SAR(4) process was present in a 
significant amount (p4>.2). This mirrors the results of the 
previous Monte Carlo analyses for the AR(1) process cited 
earlier in the chapter. This simulation indicated that 
estimators three and four were the best estimators of p^ . 
Different results may be obtained for differently structured 
independent variables. The performances of estimators three 
and four were nearly identical. The selection of one over 
the other may be based upon the smoothness of the explana- 
tory variable or the approximate value or range of p^ , if 
known. It was expected beforehand that the maximum likeli- 
hood and iterative Prais-Winsten estimators, the two most 
time costly estimators, would have outperformed all of the 
other estimators. They did in fact finish third and fourth, 
with estimator nine proving to be slightly better than esti- 
mator eight. It is worth noting again that none of the 
previous AR(1) process studies compared the AR(1) process 
versions of estimators three and four. Therefore it was 
believed that, like all of the other two-stage estimators, 
they would have been outperformed by the maximum likelihood 
and iterative Prais-Winsten estimators, as was shown to be 
the case in the previous studies. The result that estimator 
nine was better than estimator eight was the opposite of 
that obtained in the previous AR(1) process studies, but in 
both cases their performances were very close. As in the 
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previous studies it was proven that iteration leads to a 
more efficient estimator, as the iterative Prais - Winsten 
estimator was again shown to be better than its two-stage 
counterparts. Of the two Prais-Winsten estimators, esti- 
mator seven (Park and Mitchell's version) appeared to be 
better overall. This agreed with Park and Mitchell's find- 
ings in their analysis of first-order autocorrelation esti- 
mators . Knowledge of the approximate range of beforehand 
would help in selecting the more appropriate of these two 
estimators. In the lower region estimator one was better, 
while estimator seven dominated in the upper region. The 
next chapter presents and compares the final regression 
models obtained for the three estimators, numbers three, 
four, and nine. 
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V. STRUCTURAL ANALYSIS 



A. GENERAL 

In this chapter the procedures outlined in chapter three 
were followed to obtain EGLS regression models for each 
contractor. A separate model was obtained using each of the 
three preferred estimators from the previous chapter, esti- 
mators three, four, and nine. All of the models are 
presented for comparison. Due to the large number of models 
obtained the entire procedure is illustrated in detail for 
only one, contractor A's EGLS model for estimator three. 
Only the final results of the other models are presented. 
In all cases direct labor personnel was utilized as the 
explanatory variable and total overhead costs as the 
dependent variable. The computer programs utilized in the 
structural analysis are contained in the appendix. 

B. PROCEDURE 

Table 5 presents the results of these procedures applied 
to the regression of total overhead costs for contractor A 
(TOTOHA) on direct labor personnel for contractor A 
(DIRPERA) . The results of this initial regression indicated 
very poor results. The adjusted R-squared value was very 
low and the F-statistic (not including constant term) was 
very close to its five percent critical value of 4.32 even 
though both were inflated due to the presence of autocorre- 
lation. The low R-squared value indicated that the 
regression equation explained little beyond the mean of the 
dependent variable (Boger, 1983, p.21). Though biased 
downward, also due to the presence of autocorrelation, the 
standard errors of the regression coefficients were still 
large relative to the magnitude of the coefficients . 
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TABLE 5 




RESULTS FOR CONTRACTOR A 


Model: TOTOHA = a + 


bDIRPERA 




Untransformed Data 


Standard Error of the Regression: 
Adjusted R-sguared: 

F-statistic (degrees of freedom): 
Estimate of a: 

Standard Error: 

Estimate of b: 

Standard Error: 

Durbin- Watson Test Statistic: 
Wallis Test Statistic: 

Estimator Three: 

Estimator Four: 

Estimator Nine: 


15000. 

.1504 

5.072 (1,22) 

76270. 

63930. 

12.06 

5.355 

1.688 

.5480 

.7260 

.7381 

.7843 




Transformed Data 


Estimator Three 




Standard Error of Regression: 
Adjusted R-squared: 
F-statistic ; 

Estimate of a: 

Standard Error: 

Estimate of b: 

Standard Error: 


9503. 

.8660 

149.7 

69090. 

33980. 

12.18 

2.787 


Estimator Four 




Standard Error of Regression: 
Adjusted R-squared: 
F-statistic : 

Estimate of a: 

Standard Error: 

Estimate of b: 

Standard Error: 


9468. 

.8649 

148.3 

68650. 

33740. 

12.20 

2.760 


Estimator Nine 




Standard Error of Regression: 
Adjusted R-squared: 
F-statistic: 

Estimate of a: 

Standard Error: 

Estimate of b: 

Standard Error: 


9361. 

.8571 

138.9 

67224. 

33230. 

12.26 

2.687 
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The autocorrelation function of the residuals obtained 
from this regression (Figure 5.1), with its single signifi- 
cant spike at lag four, strongly suggested the presence of 
Wallis's SAR(4) process. Upon formally testing the residuals 
for the presence of this SAR(4) process, the null hypothesis 
that no fourth-order autocorrelation was present was clearly 
rejected. At this point the Durbin- Watson statistic was 
insignificant . 




Figure 5.1 Autocorrelation Function of the Residuals 

for Contractor A. 

As can be observed, the three calculated estimators of 
were relatively close. The data were then transformed and 
the model reestimated. This step was performed three times 
in order to obtain a reestimated model for each of the esti- 
mators of p^. The residuals of the new model were then 
analyzed for the presence of autocorrelation. In all three 
cases the Durbin-Watson statistic proved to be significant. 
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indicating the presence of first-order autocorrelation. The 
data were again transformed, this time to eliminate the 
AR(1) process (using the calculated estimator of p]^), and 
the model was reestimated. Examination of the residuals from 
this model resulted in the finding that both the 
Durbin- Watson and Wallis test statistics were insignificant, 
indicating that neither the AR(1) nor SAR(4) processes were 
present in the residuals. The autocorrelation function of 
these residuals (Figure 5.2) also showed that no autoregres- 
sive process was present at any level. 




Figure 5.2 Autocorrelation Function of the Residuals 
for Contractor A's model (estimator 3). 

The resulting model now had residuals that were free of 
autocorrelation. The residuals were then analyzed to deter- 
mine if all of the assumptions required for the regression 
were satisfied. Note that this does not mean that it has 
been concluded that the assumptions are all necessarily 
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correct. It merely means that "on the basis of the data we 
have seen, we have no reason to say that they are incorrect" 
(Draper and Smith, 1981, p.l42). 

Three tests were performed to check the normality of the 
residuals. An empirical cumulative distribution function 
(CDF) was generated to compare the CDF of the residuals with 
that of the appropriate distribution. In this case the 
appropriate distribution was normal with mean zero and 
standard deviation 9099. A probability (Q-Q) plot of the 
residuals was also generated which plotted the quantiles of 
the residuals against the corresponding quantiles of the 
appropriate distribution. Each of these plots (Figure 5.3) 
were bounded by the ninety- five percent Kolmogorov- Smirnov 
(K-S) confidence boundaries. Both of these plots lie 
completely within the K-S boundaries, supporting the assump- 
tion that the residuals were distributed normally with mean 
zero and standard deviation 9099. To more formally test this 
hypothesis a Kolmogorov- Smirnov goodness-of - f it test was 
performed with null hypothesis Hq : F(x) = F*(x), versus 

F(x)^F*(x), where F*(x) is the normal distribution with zero 
mean and standard deviation 9099 and F(x) is the unknown 
distribution function of the data. As presented in Conover 
(1980, p.347) the test statistic for the K-S test is simply 
the greatest distance between the cdf, F*(x), and empirical 
cdf of the data. It is possible to obtain this test 
statistic directly from the normal cdf plot in Figure 5.3. 
The test was performed using a significance level of size 
a=.05. The K-S test statistic was significant at a level of 
.9693. Therefore, the null hypothesis could not be rejected. 

Two tests were performed to test the constant variance 
(homogeneity of variance) assumption. The residuals were 
plotted against the predicted dependent variable (Figure 
5.4) to see if any obvious abnormalities could be observed. 
The dispersion of the residuals appears to be fairly random 
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PERCENTILE CUMULATIVE PROBABILITY 



NORMAL CUMULATIVE DISTRIBUTION FUNCTION 




NORMAL PROBABILITY PLOT 




Figure 5.3 Tests for Normality of Residuals 
for Contractor A's model (estimator 3). 
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Figure 5.4 Tests for Homogeneity of Variance of the Residuals 
for Contractor A's model (estimator 3). 

and the variance relatively constant throughout. An F test 
was then performed. The residuals were divided into two sets 

O O 

and the null hypothesis Hq.* -G2 '^^.s tested against 

o o o o 

^1 ^^2 ’ where and 02 were the variances of the two 

sets of residuals. As presented in Mood, et.al.(1974, 
p.438) the test statistic for this F-test is 



("2 - l)S(Xii - ^ 1 ) ^ 

(ni - l)S(X2i - X2)^’ 



(5.1) 



where X^ and X 2 are the population expected values. This 
test statistic has the F distribution with degrees of 
freedom n^^-l and n 2 ~l when - 02 '^* This test was also 
performed using a significance level of size a = .05. The 

test statistic was significant at a level of .5154. 
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Therefore the constant variance assumption could not be 
rej ected . 

These tests could not reject the assumption that the 
residuals were in fact normally distributed random variables 
with zero mean and constant variance. 

The results for this regression indicated that the 
model contained a great deal of information on overhead 
costs. The R-squared value was significantly high indicating 
that the model contained much more information than just the 
mean of the dependent variable. The standard errors of the 
regression coefficients, especially that of the slope term, 
were now relatively small in comparison to the coefficients. 
In summary, after transforming the data numerous times to 
eliminate all autocorrelation from the residuals, an EGLS 
regression model was obtained that yielded excellent, reli- 
able results. This model for the total overhead costs for 
contractor A was 

TOTOHA = 69090 + 12.18 DIRPERA. 

Since all costs were measured in thousands of dollars, the 
model may be interpreted as indicating that there is a fixed 
cost component of approximately $69,090,000 to overhead 
costs (when a function of direct labor personnel) with an 
additional $12,180 per direct labor personnel to total over- 
head costs (Roger, 1983, pp. 24-25). 

The results of the final EGLS models for contractor A 
using estimators four and nine are also contained in Table 
5. All required assumptions about the residuals in these 
models also appeared to be valid. All of the models yielded 
excellent results that were very comparable. The slope 
terms of all of the models were within eight one-hundredths, 
a spread of approximately three one -hundredths of one stan- 
dard deviation. The model for estimator three was slightly 
superior however. 
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This same general procedure was carried out for all of 
the remaining models , but only their final results are 
reported. The only difference in the procedure for the 
different models was in the order that the various autore- 
gressive processes were removed from the residuals. The 
normal sequence was to remove the AR processes by order of 
significance. To get an initial overview of the type of 
autocorrelation that was present in the residuals of the 
initial models, the autocorrelation functions of the resi- 
duals (Figure 5.5a and 5.5b) were examined. This, combined 
with the results of the Durbin-Watson and Wallis tests, gave 
the AR process that the data were to be adjusted for first. 
The remaining order was then determined from the results of 
the Durbin-Watson and Wallis tests performed on the 
residuals of the previous model. 

The autocorrelation functions showed that only the resi- 
duals for contractor A clearly appeared to possess the 
pattern expected to be exhibited by residuals containing 
Wallis's SAR(4) process. It was also clearly visible that 
the models for contractors C and D did not possess this 
SAR(4) process at a significant level. As with all of the 
contractors they will be examined more thoroughly later in 
this chapter. Despite the uncharacteristic appearance of 
their autocorrelation functions it appeared that Wallis’s 
SAR(4) process was also the most significant form of 
autocorrelation present in the initial models for the 
remaining contractors (B, E, and F). 

A number of factors led to this conclusion. First, the 
amount of data was very small relative to the amount 
required to obtain an accurate portrayal of the autoregres- 
sive process from the autocorrelation function. It should be 
noted that in the cases of contractors A, B, E and F the 
spike for the lagged four residuals was generally most 
significant. Secondly, the data were quarterly and as was 
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Figure 5.5a Autocorrelation Functions of the Residuals. 
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Figure 5.5b Autocorrelation Functions of the Residuals. 
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mentioned in chapter three, the plots of their dependent 
variables did appear to exhibit the seasonal pattern 
decribed by Wallis. Last, as will be shown, the Wallis test 
statistic was significant for the residuals from each of 
their initial models. 

C. THE REMAINING MODELS 
1 . Contractor B 

Table 6 contains the results for contractor B. The 
results of the initial regression of the untransformed data 
were fairly good. All of the statistical results, adjusted 
R-squared, standard error of regression, F-statistic, and 
standard errors of the regression coef ficients , supported 
this conclusion. But these statistical results could have 
been misleading due to the presence of autocorrelation. 
Analysis of the residuals showed the Wallis test statistic 
to be significant, indicating the presence of the SAR(4), 
while the Durbin-Watson test statistic was shown to be 
insignificant. The data were then transformed and the model 
reestimated for each of the three estimators of . Again 
all of the estimators were fairly close. These new models 
all provided excellent results. Analysis of the residuals 
showed no presence of any AR process, and none of the neces- 
sary assumptions pertaining to the residuals were violated 
in any of the models. Again the performance of all three 
models was very comparable, but the one obtained using 
estimator nine was slightly superior. It should be noted 
that all three models have negative intercepts. This implies 
negative fixed costs for those models. Though implausible, 
it is not totally infeasible. The models are being fit to 
data that are far from the Y-axis and, as with all models, 
these are only valid within the relevant range defined by 
the chosen explanatory variable. 
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TABLE 6 






RESULTS FOR CONTRACTOR B 




Model: TOTOHB = a + 


bDIRPERB 






Untransformed 


Data 


Standard Error of the Regression: 
Adjusted R-squared: 

F-statistic (degrees of freedom): 
Estimate of a: 

Standard Error: 

Estimate of b: 

Standard Error: 

Durbin-Watson Test Statistic: 
Wallis Test Statistic: 

Estimator Three: 

Estimator Four: 

Estimator Nine: 


9219. 

.7657 

69.61 

-144300. 

4450. 

27.98 

3.354 

2.206 

.75 

.625 

.6385 

.7256 


(1,20) 




Transformed Data 


Estimator Three 






Standard Error of Regression: 
Adjusted R-squared: 

F- statistic : 

Estimate of a: 

Standard Error: 

Estimate of b: 

Standard Error: 


7728. 

.9219 

248.7 

-110200. 

50980. 

25.41 

3.84 




Estimator Four 






Standard Error of Regression: 
Adjusted R-squared: 

F- statistic : 

Estimate of a: 

Standard Error: 

Estimate of b: 

Standard Error: 


7715. 

.9235 

254.3 

-109000. 

51260. 

25.32 

3.861 




Estimator Nine 






Standard Error of Regression: 
Adjusted R-squared: 

F- statistic : 

Estimate of a: 

Standard Error: 

Estimate of b: 

Standard Error: 


7651. 

.9287 

274.5 

-100500. 

53240. 

24.68 

4.006 
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2. Contractor C 



The results for contractor C are contained in Table 
7. Very poor results were obtained for the initial regres- 
sion. From the autocorrelation function of the residuals of 
this model (Figure 5.5a) it appeared that the AR(1) process 
was the most significant form of autocorrelation present. 
The Durbin-Watson test statistic supported this finding. It 
was significant, indicating the presence of first-order 
autocorrelation. As was expected, the Wallis test statistic 
was insignificant. The data were then transformed to 
eliminate the presence of the AR(1) process, and the model 
reestimated. Examination of the residuals of this model 
showed no autocorrelation present nor any required assump- 
tions violated. The final model had been obtained without 
requiring a data transformation for the SAR(4) process. The 
results of this model, though inferior to the previous two, 
were fairly good. 

3 . Contractor D 

Table 8 presents the results for contractor D. Again 
fairly poor results were obtained for the initial regres- 
sion. From the autocorrelation function of the residuals of 
this model (Figure 5.5b) it appeared that no AR process was 
present in any significant amount. As expected, the 
Durbin-Watson and Wallis test statistics were both insignif- 
icant. So the final model, though rather poor, had been 
obtained without requiring any data transformations. Further 
analysis of the residuals indicated that again all necessary 
assumptions appeared to hold. The model for contractor D's 
total overhead costs is probably unreliable. 

4 . Contractor E 

The results for contractor E are presented in Table 
9. Again very poor results were obtained for the initial 
model. The sequence of steps for contractor E were the same 
as that for contractor A. First, it was necessary for the 



64 



TABLE 7 






RESULTS FOR CONTRACTOR C 




Model: TOTOHC = a + bDIRPERC 






Untransformed Data 


Standard Error of the Regression: 
Adjusted R-squared: 

F-statistic (degrees of freedom): 
Estimate of a: 

Standard Error: 

Estimate of b: 

Standard Error: 

Durbin-Watson Test Statistic: 
Wallis Test Statistic: 

Estimator of 


11260. 

.2123 

7.2 

158100. 

12440. 

2.148 

.8004 

.7814 

1.524 

.6263 


(1,22) 




Transformed 


Data 


Estimator for p^^ 






Standard Error of Regression: 
Adjusted R-squared: 
F-statistic : 

Estimate of a: 

Standard Error: 

Estimate of b: 

Standard Error: 


9120. 

. 6891 
51.97 
149700. 
21420. 

2.699 

1.389 





data to be transformed to eliminate the presence of the 
SAR(4) process, and then the model was reestimated. As 
usual, a reestimated model was calculated for each of the 
three estimators of p^. Estimators three and four were again 
very close while estimator nine was significantly larger. 
Next it was necessary (in all three cases) for the data to 
be transformed to eliminate the AR(1) process from the resi- 
duals. The EGLS regression of this transformed data proved 
to be the final reestimation required. Analysis of these 
residuals showed no presence of autocorrelation nor viola- 
tion of any required assumptions. All three final models 
provided fairly good results and were again very comparable. 
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TABLE 8 

RESULTS FOR CONTRACTOR D 




Model: TOTOHD = a + 


bDIRPERD 






Untransformed 


Data 


Standard Error of the Regression: 


10240. 




Adjusted R-squared: 


.5291 


(1,22) 


F-statistic (degrees of freedom): 


26.84 


Estimate of a: 


5217 . 


Standard Error: 


18400. 




Estimate of b: 


14.88 




Standard Error: 


2.873 




Durbin-Watson Test Statistic: 


1.450 




Wallis Test Statistic: 


1.366 





The model for estimator nine was again slightly better, 
however . 

5 . Contractor F 

Table 10 presents the results for contractor F. This 
time the results for the OLS model were fairly good. But 
again, these statistical results could have been misleading 
due to the presence of autocorrelation. As with contractor E 
the data had to be transformed for the SAR(4) and then the 
AR(1) process, and the model reestimated before a final, 
"uncorrelated" model was obtained. Examination of these 
residuals showed that no type of autocorrelation was present 
and that all required assumptions appeared to hold. The 
results from all three final models were again very 
comparable, and provided very good results. The model for 
estimator three was slightly superior, however. 

D . SUMMARY 

Two things were performed in this chapter. First, 
regression models were obtained for each contractor that 
allowed for comparison of their total overhead costs. 
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TABLE 9 


RESULTS FOR CONTRACTOR E 




Model: TOTOHE = a + 


bDIRPERE 






Untransformed 


Data 


Standard Error of the Regression: 


3424. 




Adjusted R-squared: 


.2427 


(1,20) 


F-statistic (degrees of freedom): 


7.73 


Estimate of a: 


27390. 


Standard Error: 


8622. 




Estimate of b: 


6.549 




Standard Error: 


2.356 




Durbin-Watson Test Statistic: 


1.043 




Wallis Test Statistic: 


.8996 




Estimator Three: 


.5502 




Estimator Four: 


.5631 




Estimator Nine: 


.7262 






Transformed Data 


Estimator Three 


Standard Error of Regression: 


2897 . 




Adjusted R-squared: 


. 7951 




F-statistic : 


82.49 




Estimate of a: 


19450. 




Standard Error: 


12880. 




Estimate of b: 


8.802 




Standard Error: 


3.662 




Estimator Four 


Standard Error of Regression: 


2893. 




Adjusted R-squared: 


.7969 




F-statistic : 


83 . 39 




Estimate of a: 


19240. 




Standard Error: 


12790. 




Estimate of b: 


8.864 




Standard Error: 


3.637 




Estimator Nine 


Standard Error of Regression: 


2836. 




Adjusted R-squared: 


.8022 




F-statistic : 


86.15 




Estimate of a: 


18580. 




Standard Error: 


11820. 




Estimate of b: 


9.08 




Standard Error: 


3.358 
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TABLE 10 


RESULTS FOR CONTRACTOR F 




Model: TOTOHF = a + 


bDIRPERF 






Untransformed 


Data 


Standard Error of the Regression: 


17470. 




Adjusted R-squared: 


.6156 




F-statistic (degrees of freedom): 


37.84 


(1,22) 


Estimate of a: 


219600. 


Standard Error: 


24470. 




Estimate of b: 


10.14 




Standard Error: 


1.648 




Durbin-Watson Test Statistic: 


1.927 




Wallis Test Statistic: 


.4977 




Estimator Three: 


.7512 




Estimator Four: 


.7634 




Estimator Nine: 


. 7872 






Transformed Data 


Estimator Three 


Standard Error of Regression: 


9660. 




Adjusted R-squared: 


.9664 




F- statistic : 


663.3 




Estimate of a: 


229700. 




Standard Error: 


27490. 




Estimate of b: 


9.175 




Standard Error: 


1.847 




Estimator Four 


Standard Error of Regression: 


9660. 




Adjusted R-squared: 


.9664 




F-statistic : 


663.3 




Estimate of a: 


229700. 




Standard Error: 


27490. 




Estimate of b: 


9.175 




Standard Error: 


1.847 




Estimator Nine 


Standard Error of Regression: 


9641. 




Adjusted R-squared: 


.9638 




F-statistic : 


613.8 




Estimate of a: 


227600. 




Standard Error: 


28250. 




Estimate of b: 


9.3 




Standard Error: 


1.885 
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Second, since final models were calculated for each of the 
three estimators of p^, another means to compare the estima- 
tors was obtained. 

The analysis indicated that excellent structural 
results were obtained for all but one contractor, contractor 
D. All of the others yielded reliable, useful regression 
models. For comparative purposes the model that provided 
the best results for each contractor is presented: 



TOTOHA 

TOTOHB 

TOTOHC 

TOTOHD 

TOTOHE 

TOTOHF 



69090 + 12.18 DIRPERA 
-100500 + 24.68 DIRPERB 
149700 + 2.699 DIRPERC 

5217 + 14.88 DIRPERD 
18580 + 9.08 DIRPERE 

229700 + 9.175 DIRPERF. 



The model for contractor D was included in the comparison 
even though its results were unreliable. 

It is possible to use these models to compare the over- 
head cost structures of the firms in the sample. The model 
for contractor E lies everywhere below the regressions for 
contractors A and F. In each case contractor E had both a 
significantly lower fixed cost and a lower (not significant) 
variable cost than the other contractor. Likewise contractor 
C's model was uniformly below that of contractors F. In this 
case though, the differences in the fixed and variable costs 
were both statistically significant. It should be noted 
that the comparisons imply only that, with the same number 
of direct personnel, one contractor experiences lower total 
overhead costs than another. They do not imply that the 
contractor possesses lower total overhead costs regardless 
of the circumstances (Boger, 1983, p.33). 

For all relevant contractors the models for the three 
estimators were all very comparable. They were so comparable 
in fact, that no distinction could be drawn as to which 
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estimator was better. The superiority of estimators three 
and four over estimator nine in the Monte Carlo simulation 
was obviously not significant enough to manifest itself in 
this structural analysis. 
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VI. CONCLUSIONS 



The objectives of this thesis were twofold. The first 
objective was to determine the best estimators of autocorre- 
lation found in the residuals. The second was to obtain 
simple and efficient regression models for overhead costs 
for the six aerospace contractors. The first objective was 
simply a means of achieving the second since the quality of 
the EGLS regression is dependent upon the quality of the 
estimator used. 

Two methods were utilized to determine the best 
estimators of fourth-order autocorrelation, a Monte Carlo 
simulation and comparison of the EGLS regression models of 
the three preferred estimators (from the simulation). The 
results from the Monte Carlo simulation indicated estimators 
three and four, the two estimators that utilize the Wallis 
test statistic, to be the two best estimators of fourth- 
order autocorrelation. Their overall performances were 
nearly identical with preference determined by the value of 
p^. Estimator three was superior to estimator four for 
less than or equal to .5, while four was superior for p^ 
greater than .5. The maximum likelihood estimator, esti- 
mator nine, was shown to be the third best estimator. 
Another important result from the simulation was that all 
nine of the estimators provided an improvement over the OLS 
solution when the SAR(4) process was present. Comparison of 
the respective EGLS models showed no difference between the 
performances of the three estimators. The models were so 
comparable that no distinction could be drawn between the 
estimators . 

Due to their superior performances in the Monte Carlo 
simulation, estimators three and four were chosen as the 
best estimators of fourth-order autocorrelation. However, 
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it may have appeared from the structural analysis that esti- 
mator nine was their equal. The Monte Carlo simulation was 
much more sensitive to slight differences in the perform- 
ances of the estimators than the regression models. For 
this reason the simulation was the main criterion used in 
selecting the best estimator. Comparison of the regression 
models in the structural analysis would have been more 
useful if distinct disparities had arisen between the 
performances of the estimators. 

From this analysis no distinct preference could be made 
between estimators three and four. Both estimators are 
fairly easy to calculate once the Wallis test statistic has 
been computed, so difficulty of computation can not be used 
to determine a preference between the two. The only 
criterion found to discriminate between the two is the 
amount of fourth-order autocorrelation present in the resi- 
duals (the value of p^) and the "smoothness” of the the 
explanatory variables. If the value of (if it can be 
speculated) is less than or equal to .5 then estimator three 
would be expected to perform better, while estimator four 
would be preferred for p^ greater than .5. Though not tested 
herein, if the explanatory variables are "smooth" then esti- 
mator four would again be expected to outperform estimator 
three. Though not as scrupulously investigated in this 
thesis. Park and Mitchell's version of the Prais -Winsten 
estimator was selected as the estimator of first-order auto- 
correlation. It was selected because it was shown to be the 
superior two-stage estimator for first-order autocorrelation 
in Park and Mitchell’s 1980 study. 

The structural analysis showed that excellent results 
could be obtained for four of the six aerospace contractors. 
The results of a fifth model were also more than adequate. 
These excellent results were only obtainable after the 
effects of autocorrelation were transformed out of the 
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dependent and independent variables. As was the case in 
Boger's 1984 analysis, the five superior structural models 
should provide excellent forecasting results. It can ulti- 
mately be concluded that, after eliminating the effects of 
autocorrelation though transformation, a simple, efficient 
model can be obtained to directly estimate overhead costs 
for five of these six aerospace contractors. 
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APPENDIX 



COMPUTER PROGRAMS 

This appendix contains listings of the programs utilized 
in the analysis performed herein. All of the functions are 
written in APL and contain documentation. The programs 
utilized by the Monte Carlo simulation in Chapter 4 were 
SIM, RREGRESS, LAGS, DURBIN, CALLl , and TRANS. The following 
is a general description of what these programs do and the 
sequence of steps followed in the Monte Carlo simulation. 

First, the function SIM generates the data required by 
the simulation. Next, an OLS regression of the generated 
dependent variable, Y^. , on the independent variable, , is 
performed. The residuals of this OLS regression are then 
sent to the function LAGS which computes the nine estimators 
of and the MSE of p^. Next, the function TRANS transforms 
the data using the appropriate estimator. An EGLS regression 
is then performed on this transformed data. These last two 
steps are actually performed in the function CALLl which 
calls the functions TRANS and RREGRESS in succession. The 
function LAGS then takes the results of this EGLS regression 
and computes the MSE of j3£ and the Adjusted R- squared value. 
This cycle is replicated one hundred times for each value of 
p^ The function RREGRESS is called to perform all of the 
regressions required in the simulation. The functions 
DURBIN, PWIT and MAX are called to compute the Durbin, iter- 
ative Prais -Winsten , and maximum likelihood estimators of 
fourth-order correlation. 

The programs utilized in the structural analysis of 
Chapter 5 were REGRESS, TRANS, LAG, CHECKER, and VARTEST. 
All regressions were performed by REGRESS (Musgrave and 
Ramsey, 1981, pp . 254-258). As can be seen from the program 
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listing, REGRESS outputs numerous statistical results. The 
function TRANS was utilized to transform the data for either 
an AR(1) or SAR(4) process. The functions LAG and CHECKER 
were called to compute numerous estimators and test statis- 
tics (for first and fourth-order autocorrelation) from the 
residuals. The F-test utilized in the analysis was performed 
using the function VARTEST. 



APL FUNCTION SIM 
RHOS SIM REP 

Cl] f^THIS PROGRAM PERFORMS A MONTE CARLO SIMULATION ON 
C2] (^THE NINE ESTIMATORS OF pU. THE INPUT RHOS IS A 
C3] rVECTOR of pU VALUES THAT YOU WANT TO RUN THE 
[4] PiSIMULATION FOR, IN THIS CASE IT IS A VECTOR OF 
C5] (^LENGTH TEN OF THE VALUES: .1, .2, .3, .4, .5, .6, 

C6] P.7, .8, .9, AND .95. THE INPUT PARAMETER REP IS 

[7] f^THE NUMBER OF REPLICATIONS OF THE SIMULATION YOU 
C8] (^WISH TO RUN, IN THIS CASE 100. 

C9] PRINT*- 3 9 pO 
CIO] 0UT*r\ 

C 1 1 ] RMSERHO^AAR^EFF^RANKRHO^RANKAAR^RANKEFF 
■6-((p/?HOS),9)pO 

C12] L2:' PERFORMING TEST FOR RHO^ h ' ,^RHOSlOUTl 
C13] RHO^*rRHOS\.OUTl 

C14] (\CREATI0N OF NECESSARY VECTORS, 

Cl 5] MSERn^MSERUrS*rMSERm*rMSEP^*rMSEP^S-*MSEPLS*rMSED-*Q 
Cl6] MSEBOLS*rMSEBR^*rMSEBR'\S-*MSEBRm*rMSEBP^-*MSEBPnS-*MSEBPLS 
^ *-MSEBD*-0 

C17] AAR20LS*rAAR2Rn*rAAR2RnS-*AAR2RUL*-AAR2P^<-AAR2PiiS^AAR2PLS 

*rAAR2D*rO 

C 1 8 ] MSEPW^MSEPPML^MSEBPW<rMSEBPPML-<rAAR2PW^AAR2PPML*rREPPW 
*rREPMAX<rO 
Cl 9] PWITFAIL-*0 
C20] IN*rl 

C21] ^RANDOM GENERATION OF THE U{T), THEY ARE 
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[2 2] (^DISTRIBUTED NORMALLY WITH MEAN 0 AND VARIANCE 

[23] (^ACCORDING TO THE CONTRACTOR. 

[24] Ll:U-e-32 NORRAND 0 13454 

[25] (iSET ALL EiT) TO U (T)-i Ul- RH0n*2)* .5 ) 

[26] E-6-[/t((1-/?WO4*2)*0.5) 

[27] J-e-5 

[28] aSET THE LAST T-U E(DS TO 4 ) )+ UiT). 

[29] IT:ELIl<riRHOnxELI-^l)+ULn 

[30] I^I+l 

[31] ^jrxiJ<32 

[3 2] a SET B TO THE APPROPRIATE INTERCEPT AND SLOPE 
[3 3] f^FOR THE CONTRACTOR BEING TESTED. 

[34] B-e- 0 17 

[35] XCURRENT^ 1 , XGURRENT 

[36] ^GENERATE THE INDEPENDENT VARIABLE, Y 

[37] Y CURRENT-^ ( XCURRENT+ . x B ) +ET-<r~ 2 4 1 B 

[38] XCURRENT+ 24 “1 fXCURRENT 

[39] aPERFORM THE OLS REGRESSION OF PRESET 

[40] (^DEPENDENT VARIABLE, X, ON THE GENERATED 

[41] ^INDEPENDENT VARIABLE, Y. 

[42] Y CURRENT RREGRESS XCURRENT 

[43] MSEBOLS-<rMSEBOLS+i{BER\:2l-BL2l )*2 ) 

[44] AAR20LS-<rAAR20LS^ADJR2 

[4 5] (iCALL THE FUNCTION LAGS WITH THE RESIDUALS FROM 

[46] rTBB OLS REGRESSION AND THE VALUE OF BB04 . 

[47] BB04 LAGS UH 

[48] IN^IN+1 

[49] ^LlxiIiV<BBP 

[50] (^COMPUTATION OF MOE' S AND OUTPUT 

[51] MSERHO^MSERU , MSERUS , MSBP4 , MSEPnS , MSED , MSEPLS , MSERnL , 
MSEPW ,MSEPPML 

[52] RMSERHOLOUTil^(MSERHOTREP)*0.5 

[53] AR-^AAR2RU ,AAR2R^S ,AAR2P^ ,AAR2P^S ,AAR2D ,AAR2PLS , 
AAR2R^L ,AAR2PW ,AAR2PPML 

[54] AARZOUT;l^ARi-REP 
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[55] MS-<rMSEBRn , MSEBRUS , MSEBPn , MSEBP^^S , MSEBD , MSEBPLS , 
MSEBRUL , MSEBPW , MSEBPPML 

[56] RMSEBETA^(MSrREP)*0.5 

[57] EFFLOUT {MSEBOLS^REP ) * 0 . 5 ) ^RMSEBETA 

[58] PRINT [ 1 ; ] ^RMSEREO L0UT\1 

[59] PRINTl2\l-<rAARL0UT\'\ 

[60] PRINTiZil^EFFZOUTil 

[61] RANKRHO LOUT ; ] -^k^liJlMSERHO LOUT 
[6 2] RANKAARLOUTil^H^^AARLOUTxl 

[6 3] RANKEFFLOUTO^H:^EFFLOUTil 

[64] 'FOR RHO'^ H *,5i/?ff04 

[65] 'RMSE RHO, ADJUSTED RSQR AND EFF:' 

[66] '12345678 9' 

[67] PRINT 

[68] 'AVG ADJ R2 FOR OLS = ' ,^iAAR20LSirREP) 

[69] 'AVG REPS FOR PWIT = ' ,^iREPPW^REP) 

[70] 'PWIT FAILED TO CONVERGE = ' ,^PWITFAIL 

[71] 'AVG REPS FOR MAX b ' ,^(REPMAXtREP) 

[72] ' ' 

[73] OUT^OUT+1 

[74] ^L2xxOUT<pRHOS 

[75] 'RMSE RHO' 

[76] RMSERHO 

[77] ' ' 

[78] 'RANK RMSERHO' 

[79] RANKRHO 

[80] 'AVG RANK RMSERHO' 

[81] i+/RANKRHO)^pRHOS 

[82] ' ' 

[83] 'ADJUSTED R*2' 

[84] AAR 

[85] ' ' 

[86] 'RANK ADJUSTED R*2' 

[87] RANK AAR 

[88] 'AVG RANK ADJ R*2' 



79 



[89] (+/RANKAAR)rpRHOS 

[90] ' » 

[91] 'EFFICIENCY' 

[92] EFF 

[93] ' ' 

[94] ' RANK EFFICIENCY ' 

[95] RANKEFF 

[96] 'AVG RANK EFF. ' 

[9 7] (+;^RANKEFF')ipRHOS 



APL FUNCTION LAGS 
RE on LAGS XX 

[1] aUSED IN THE MONTE CARLO SIMULATION. IT IS CALLED 

[2] pro COMPUTE THE NINE ESTIMATORS OF pn AND TO 

[3] ^PERFORM THE EGLS REGRESSION AND CALCULATE 

[4] rxTHE MOES FOR EACH ESTIMATOR. THE INPUT RHOn IS THE 

[5] rxVALUE OF pn THAT THE SIMULATION IS BEING PERFORMED 

[6] fxFOR THE INPUT XX IS A VECTOR OF THE RESIDUALS. 

[7] 

[8] J’^pXX 

[9] 

[10] K-^J-n 

[ 11 ] AA<rEE^\K 

[12] P DETERMINE THE UN LAGGED AND LAGGED RESIDUALS. 

[13] LOOP:ID<-L-I 

[14] ^.^[J]^XX[JO] 

[15] E£[J]^XX[JZ)- 4] 

[16] J^I+1 

[17] -^LOOP^\I<K 

[18] r4^(pXX)-4 

[19] r5^r4-i 

[20] i^^+/XX*2 

[ 21 ] 42^+/((0,0,0,0, (r5pl),0)/XX)*2 
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C2 2] Z?4-e-(4-/l)x+/(((0,0,0,0,r4pl)/XX)- ( (2’4p 1) , 0 , 0 , 0 , 0 )/XX )*2 

[23] ^CALCULATE TEE SEVEN ESTIMATORS OF p4 

[24] F4>6-(4-A)x + /(((0,0,0,0,r4pl)/XX)x((2’4pl),0,0,0,0)/XX) 

[2 5] RnSTAR^{U-2)^Rn)TiJ-l) 

[26] F4L<-(TA2)x+/(((o,0,0,0,r4pl)/XX)x((r4pl),0,0,0,0)/XX) 

[27] P4<-1- (0 . 5xZ?4 ) 

[28] PnSTAR-^ii. iJ*2 )xP4 )+4 )4 ( (J*2 )- 4 ) 

[29] PLS^AA^EE 

[30] (\CALL THE FUNCTIONS DURBIN, PWIT , AND MAX TO 

[31] (^CALCULATE THE DURBIN, ITERATIVE PRAIS-WINSTEN , 

[3 2] (^AND MAXIMUM LIKELIHOOD ESTIMATORS 

[33] DURBIN 

[34] PNIT 

[3 5] -^MM^iiPWITF^O) 

[36] PNITFAIL^PWITFAIL-^1 

[37] ^FFx 1 iPWITF=l ) 

[38] MM\ REPPW<-REPPN+ ( NPW- 1 ) 

[39] FF:MAX 

[40] ^>lAxi (M>1XF=0 ) 

[41] AA: REPMAX^REPMAX+ (NML- 1 ) 

[42] (^CALCULATE THE MSE OF p4 FOR EACH ESTIMATOR 

[43] A?SFP4-6-MSFP4 + ((P4-PP04)*2) 

[44] MSFP4S-e-WSFP4S+( (P4SZ’71P-PP04)*2 ) 

[45] MSFP4L^M5FP4L+( (P4L-PP04)*2 ) 

[46] MSFP4-«-M5FP4+((P4-PP04)*2) 

[47] MSEP^S■^MSEP^S+ i (.PnSTAR- RHO^ )*2 ) 

[48] MSEPLS-^MSEPLS-^ ( {PLS- RHOU ) * 2 ) 

[49] MSED^MSED+ UPDURBIN- RHOn )*2 ) 

[50] (P/7JrP=l ) 

[51] MSEPN^MSEPW+ ( (.PH- RHOU )*2) 

[52] MS: MSEPPML-^MSEPPML + ( ( PPML- RHO 4 ) * 2 ) 

[53] ^PERFORM THE EGLS REGRESSION AND CALCULATE MSE BETA 

[54] r\AND ADJUSTED R- SQUARED FOR EACH OF THE ESTIMATORS 

[55] CALLl Rn 

[56] MSEBRn<rMSEBR^ + ((BERL2'\- B12'\ )*2 ) 
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C57] 


AAR2R^-<rAAR2R^^ADJR2 




C58] 


CALLl RnSTAR 




C59] 


MSF5P4S-«-WSF5P4S+( (SFPC2]- BC2] )* 


2) 


C60] 


AAR2R^S-<-AAR2RnS-^ADJR2 




C61] 


CALLl RnL 




C62] 


WSFSP4L>6-WSFSP4L+( (SFPC2]-PC2] )* 


2) 


C63] 


AAR2R^L^AAR2R^L-^ADJR2 




C64] 


CALLl P4 




C65] 


MSEBPU^MSEBPn + UBERL2l-BL2l )*2) 




C66] 


AAR2Pn^AAR2Pn+ADJR2 




C67] 


CALLl PnSTAR 




C68] 


MSEBPUS-^MSEBPnS+((.BERL2l- BL21 )* 


2) 


C69] 


AAR2PnS^AAR2PnS+ADJR2 




C70] 


CALLl PLS 




C71] 


MSEBPLS^MSEBPLS+i (SFPC2]-SC2] )* 


2) 


C72] 


AAR2PLS^AAR2PLS+ADJR2 




C73] 


CALLl PDURBIN 




C74] 


MSFSP^MSFSP+((BFPC2]-BC2] )*2) 




C75] 


AAR2D^AAR2D-^ADJR2 




C76] 


■^CCy-\iPWITF=l ) 




C77] 


CALLl PW 




C78] 


MSEBPW^MSEBPW+ ( ( BER C2]-BC2])*2) 




C79] 


AAR2PW^AAR2PW+ADJR2 




C80] 


CC: CALLl PPML 




C81] 


MSEBPPML^MSEBPPML +((BFPC2]-BC2])* 


C82] 


AAR2PPML^AAR2PPML+ADJR2 





APL FUNCTION TRANS 
P TRANS V 

Cl] f^THIS FUNCTION TRANSFORMS THE RAW DATA FOR EITHER 

[2] (^AN AR{1) OR AN SAR{^) PROCESS. THE INPUT P IS A 

[3] (^VECTOR OF LENGTH 2 WHERE P[l] IS THE ESTIMATE OF pi 

[4] f\AND PC 2] IS THE ESTIMATE OF p4. THE INPUT V IS THE 
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[5] f\VARIABLE YOU WANT TO BE TRANSFORMED. V CAN BE 

[6] rEITHER a vector OF LENGTH N OR A Ny~l MATRIX. 

[7] CZW^pp7 

[8] ^VECTOR^\DIM=\ 

C9] rRE SHAPE V INTO A VECTOR IF IT WAS ENTERED 
CIO] rAS a N^l MATRIX 

[11] CHECK^pV 

[12] CHECKl^CHECKLll 

[13] V^CHECKlpV 

[14] VECTORlPl^PLll 

[15] P2^PL21 

[16] ■^NEXTx\P2>0 

[17] rloop to be performed for the ar{i) transformation 

[18] Jl^(p7)-1 

[19] 71-^(0,Jlpl )/7 

[20] 7P1^( (Jlpl ),0)/7 

[21] 771^7[l]x((l- (P1*2))*0.5) 

[22] 772-e-71- (P1X7P1 ) 

[23] 77-e-771,772 

[24] -^ENDy-iDIM^l 

[25] rRESHAPE the VARIABLE INTO A N^l MATRIX IF IT 

[26] rWAS entered AS SUCH 

[27] 77<-(CPPC/^l ,l)p77 

[28] -^END 

[29] rLOOP to be performed FOR THE AR{n) TRANSFORMATION 

[30] iVPXT: J4-«-(p7)-4 

[31] 71^(0, 0,0,0,J4pl)/7 

[32] 7P4^((j4pl),0,0,0,0)/7 

[33] rCHECK if the ESTIMATOR IS < 1.0 AND IF NOT: 

[34] ^0KxiP2<l 

[35] P 1) SET THE ESTIMATOR TO .99999 

[36] P2-^0. 99999 

[37] nSAP(4) TRANSFORMATION COMPUTATION 

[38] 0X:7P1234^4+7 

[3 9] 771234-^7Pl2 34x((i- (P2*2))*0.5) 
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[40] 775-S-71- (P2x7P4) 

[41] 77^771234,775 

[42] -^ENDxixDIMBl 

[4 3] fxRE SHAPE THE VARIABLE INTO A Nxi MATRIX IF IT 

[44] f^WAS ENTERED AS SUCH 

[45] VV^{CHECKl,l)pVV 

[46] END'. 



APL FUNCTION DURBIN 
DURBIN 

[1] rPPJS function calculates the DURBIN ESTIMATOR OP p4. 

[2] DIMD^pYCURRENT 

[3] TnD^DIMD-n 

[ 4 ] Y DURBIN^ ( 0 , 0 , 0 , 0 , P4Pp 1 ) /YCURRENT 

[5] XDURBIN-<r{TnD ,^)p0 

[6] XPPPPJiV[ ;1]^1 

[7] XDURBINl ; 2]f- ( (P4Dp 1 ) , 0 , 0 , 0 , 0 )/YCPPPP/72’ 

[8] XCT<rDIMDpXCURRENT 

[9] XD£/PBJA?[ ;3]-e-(0,0 ,0,0 ,P4Ppl )/XCP 

[10] XDURBINl\nl^~ly.{ ( (P4Ppl ) ,0 ,0 ,0 ,0 )/XCT) 

[11] COE<rYDURBINmDURBIN 

[12] P5-e-COP[2] 



APL FUNCTION PWIT 
PWIT 

[1] f\THIS FUNCTION COMPUTES THE ITERATIVE PRAIS- WINSTEN 

[2] (^ESTIMATOR. 

[3] TP^pYCURRENT 

[4] TPN-<rTP- 4 

[5] CHE^l 

[6] PWITF^NPW^O 
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[7] XPW<-XCURRENT 

[8] YPW^YCURRENT 

[9] )pl 

[10] ITER : UHPW-^ ( Y CURRENT- ( YHPW^ ( 1 , XCURRENT ) + . x ( BEPW^rYPWE 

UPW-^(XPWT1 ,XPW ) ) ) ) ) 

[11] DEN^+/ UO ,0 ,0,0 ,((.TPN-l)pl) ,0)/UHPW)*2 

[12] PW^(^DEN)^ + / (.({0 ,0 ,0 ,0 ,TPNpl)/UHPW)^(.(.TPWpl) ,0 ,0 ,0 ,0) 
/UHPW) 

[13] P^0,PA^ 

[14] a TRANSFORM X AND Y 

[15] P TRANS XCURRENT 

[16] XPW^VV 

[17] P TRANS Y CURRENT 

[18] YPW^VV 

[19] RJP P>1 SET TO .99999 

[20] ■^CONPW^i(.PW<l) 

[21] PA^-t-0. 99999 

[22] CONPN '.XPNTl^XPNl^ { {TP ,l)p (4p ( (1-PA^*2 )*0 . 5 ) ) , ( {TP- 4 ) 

p{l-PW))) 

[23] NPW^NPN-^1 

[24] ■^DDy-\{NPW<52) 

[2 5] PWITF<-1 

[26] ^0 

[27] DD:DELTA<-CHE-PN 
[2 8] CRE-^PN 

[29] PtPERFORM ANOTHER ITERATION IF THE ABSOLUTE 

[30] (^DIFFERENCE BETWEEN THE LAST TWO ESTIMATIONS 

[31] RJS > .00001. 

[32] ( |PEL2’A)>0.00001 



APL FUNCTION MAX 
MAX 

[1] r^THIS FUNCTION COMPUTES THE MAXIMUMLIKELIHOOD 
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C2] (^ESTIMATOR OF p4 USING TEE ALGORITHM DERIVED 

[3] pSY beach and MACKINNON. 

[4] NPML^pYCURRENT 

[5] CH^2 

[6] TUrML<rNPML- n 

[7] XML^XCURRENT 

[8] YML^YCURRENT 

[9] XMLTl^XMLl^iNPML,l)pl 

[10] MAXF<-NML<-0 

[11] qcompute the residuals for the regression of X on Y. 

[12] ML: UH^ ( Y CURRENT- ( YHML^ ( 1 , XCURRENT ) + . x ( BEML^YML @ ( XML 

^UMLTl ,XML )))')') 

[13] .44^(1 ,1 ,1 ,1 , (Y4MLpO ) )/£/F 

[14] ATn^iiTUMLpl) ,0 ,0 , 0 ,0)/UH 

[15] AT^iO , 0 , 0 , 0 ,TnMLpl)/UH 

[16] DENOM^iNPML-DxU+Z (ATU*2))- i+ / iAH*2) ) ) 

[17] (^COMPUTE THE COEFFICIENTS OF THE POLYNOMIAL 

[18] AML^Cl^iNPML- 2 )x (+/ (42’xAY4 ) ) UDENOM 

[19] BML^i ( (NPML- l)x ( + / (714*2 ) ) )- ( (.NPML^ (+/ATU*2 ) ) + 
i+/AT*2 ) ) )iDENOM 

[20] CML^(NPML^i + /AT^ATn)UDENOM 

[21] PMLP<-BML- ( (AML* 2 )^3 ) 

[22] QML*-CML+ (2>^ (AML*3 ) t27 )- (AMLxBMLt3) 

[23] PFJl*-( (QML^ (27*0 . 5 ) U (2xPMLPx ( (~ lxPMLP)*0 . 5) ) ) 

[24] FHJ^"2oPHJl 

[25] Q COMPUTE THE ESTIMATOR 

[26] PPWL*-(~2x( ( (~lxpWLP)4 3 )*0 . 5 )x (2o ( (PPJtS ) + o(t3 ) ) ) )- 
(AML^3 ) 

[27] PPPML<-0 ,PPML 

[28] pCALL the function TRANS TO TRANSFORM THE RAW DATA 

[29] PPPML TRANS XCURRENT 
[3 0] XML<-VV 

[31] PPPML TRANS Y CURRENT 

[32] YML^VV 

[33] pJF P>1 RESET TO .99999 
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[34] ■*COmAX^\iPPML<l) 

[3 5] PPML-e-0. 99999 

[36] C0NMAX:XMLT1^XML1^{ {NPML , 1 )p (4p ( (1- PPML*2 )*0 . 5 ) ) , 

( {NPML- 4 )p ( 1- PPML ) ) ) 

[37] NML^NML+1 

[38] 1 52 ) 

[39] MAXF^l 

[40] ^0 

[41] MAP xDELT^CH- PPML 

[42] CH^PPML 

[43] RJP TEE ABSOLUTE DIFFERENCE BETWEEN TEE LAST TWO 

[44] (^ESTIMATES IS > .00001 TEEN PERFORM ANOTEER 
[4 5] (\ ITERATION 

[46] -^WLxi(|PPL2’)>0.00001 



APL FUNCTION CALLl 
CAL LI EST 



[1] 


qteis function calls tee transformation 


FUNCTION 


[2] 


f^TRANS FOR AN ESTIMATOR OF p4 AND TEEN ( 


CALLS TEE 


[3] 


f^FUNCTION RREGRESS AND PERFORMS AN EGLS 


REGRESSION 


[4] 


roa7 tee transformed data. 




[5] 


PEST-(-0 ,EST 




[6] 


PEST TRANS XCURRENT 




[7] 


XT^VV 




[8] 


PEST TRANS YCURRENT 




[9] 


YT<rVV 




[10] 


YT RREGRESS XT 





APL FUNCTION RREGRESS 
Y RREGRESS X;MSiSS 

[1] PtTEIS IS A CONDENSED VERSION OF REGRESS LESS 
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C2] (\TEE PRINTED OUTPUT TO BE USED IN CONJUOTION WITH 
C3] (^THE SIMULATION SIM. 

[4] NP<-pX 

[5] K^NPZll 

[6] S5^3pO 

[7] MS^U+/Y) ,MS^+/X)-^NPL11 

[8] SS[2]^(SS[1]^+/Y*2 )- A^PCl] xAJSCi]*2 

[9] UH^iY- {YH^X+.^iBER-rY^iX^Zl ,X))))) 

[10] SSC3]^+/£/ff*2 

[11] P2^1-SS[3] tSS[2] 

[12] NMINUSK^NP [ 1 ] - (if+1) 

[13] ADJR2^R2- UK^NMINUSK)^ U- R2)) 



APL FUNCTION REGRESS 

Y REGRESS XtMStD;SS 

[1] ft THIS FUNCTION PERFORMS AN OLS REGRESSION OF X ON Y . 

[2] ftTHE INPUT X IS THE INDEPENDENT VARIABLE AN IS 

[3] ft ASSUMED TO BE AN N^K MATRIX. THE INPUT Y IS THE 

[4] ft DEPEN DENT VARIABLE AND IS ASSULMED TO BE A VECTOR 

[5] ftOF LENGTH N. A DIAGNOSTIC IS PRINTED IF N<K OR IF 

[6] ft RANK X<K. THE CONSTANT TERM IS ADDED BY THE PROGRAM. 

[7] ftTHIS FUNCTION WAS OBTAINED FROM RAMSEY AND 

[8] ftMUSGRAVE'S PL- ST T {SEE REFERENCES). 

[9] FLAG<-1 

[10] NP^pX 

[ 11 ] K-<rNPL2) 

[12] SS^SpO 

[13] ^PREMENDl^\(NPll)<NPL2l+l) 

[14] CM^(.MM^W) + .^X)~ (MSo .^MS^+/X)iNPLl) 

[15] CRM^D+.yCM+.xD^^UilK)^ . = \K)xCM)*0.5 

[16] MS'«-((+/y),MS)TiVP[l] 

[17] SS[2]-«-(SS[l]'e-+/y*2 )-iVP[l]xWS[l]*2 

1; X 

[18] MAIN:UH^{Y- {YH^X+.^(.BE<-Y%iX^(.XCONl,X))))) 
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C19] SSLnl^(SSL32^+/YH*2 )- NPZllxMSLll*2 

[20] SSC5 3-s-+/yfl*2 

[21] RSQ^1-SSL51tSSL22 

[22] NMINUSK-(-NPLll- iK+1) 

[23] ADJRSQ^RSQ- i (K^NMINUSK)^ il- RSQ) ) 

[24] VARU-(rSSL5l^(- /NP)- 1 

[25] STDERR^VARU* 0 . 5 

[26] Fl^SSmii-SSLBlxKTi- /NP)-1 

[27] f2-s-SS[3]rSS[5]x(i^+l)^(- /NP)- 1 

[28] COVBE^VARU^^W^ + .xX 

[29] STDBE^il 1 ^COVBE)*0 .5 

[30] TRATIO^BEiSTDBE 

[31] ^CONT 

C-r (' 

[3 2] PREMEND :FLAG-(rO 

[33] CONTl^MA INEND xxFLAG 

[34] 'ROUTINE ENDED DUE TO SINGULARITY OF X MATRIX' 

[35] ^0 

[36] MAINENDl'THE COEFFICIENT ESTIMATES ARE, CONST., XI 

[37] BE 

[38] 'THE CORRESPONDING STANDARD ERRORS ARE.' 

[39] STDBE 

[40] 'THE CORRESPONDING T RATIOS AREl' 

[41] TRATIO 

[42] 'WITH DEGREES OF FREEDOM:' 

[43] NMINUSK 

[44] 'RSQ IS: ' ,^RSQ 

[45] 'ADJUSTED RSQ IS: ' ,9 AD J RSQ 

[46] 'STANDARD ERROR OF REGRESSION IS: ' ,^STDERR 

[47] 'VAR OF ERROR TERM IS: ' ,^VARU 

[48] 'THE F STATISTIC INCLUDING THE CONSTANT TERM IS: 

[49] F2 

[50] 'WITH DEGREES OF FREEDOM: ' 

[51] {K-\-D ,n-^- /NP) 

[52] 'THE F STATISTIC NOT INCLUDING CONSTANT TERM IS: 

[53] FI 
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C54] 'WITH DEGREES OF FREEDOM.' 

[55] 

[56] 'THIS ENDS THE OUTPUT FROM REGRESS.' 

[57] ^0 

[58] PREMENDl: 'NO. OF OBS (iV) IS TOO FEN RELATIVE THE ' 

[59] 'NO. OF REGRESSORS {K).' 

[60] 'ROUTINE TERMINATED' 



APL FUNCTION LAG 
LAG XX 

[1] qgiven the residuals this function calculates the 

[2] f^DURBIN- WATSON AND WALLIS TEST STATISTICS, THE 

[3] SINGLE ESTIMATOR OF pi , AND THREE PREFERED 

[4] f\ESTIMAT0RS Of p4 {FROM THE SIMULATION) . THE INPUT 

[5] f^XX IS A VECTOR OF LI NOTH N OF THE REGRESSION 

[6] r\RESIDUALS. 

[7] J^pXX 

[8] ri-e-J-1 

[9] T2-(rJ-2 

[10] I’4-«-J-4 

[11] il-e- + /XX*2 

[12] .ll-«-+/( (0 , (r2pl ) ,0)/XX)*2 

[13] ^CALCULATE THE TWO TEST STATISTICS 

[14] £ll-e-(x>nx+/(((0,2’lpl)/XX)- ((2’lpl),0)/XX)*2 

[15] f4-e-(T^ )x+/( ( (0 ,0 ,0 ,0 ,Z’4pl)/XX)- ( (r4pl ) , 0 , 0 , 0 , 0 ) /XX )* 2 

[16] ^CALCULATE THE ESTIMATOR OF pi 

[17] pp<r{^Ai )x+/(( (o.ripi )/xx)x( (ripi) ,0 )/xx) 

[18] (^CALCULATE THE THREE ESTIMATORS OF p4 

[19] ^ESTIMATOR NUMBER THREE IS 

[20] P3-6-1- (0 . 5xP4 ) 

[21] (^ESTIMATOR NUMBER FOUR IS 

[22] P4-e-(((J*2)xP3)+4)T((J*2)-4) 

[23] f^ESTIMATOR NUMBER NINE IS THE MAXIMUM LIKELIHOOD 
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C24] 


(^ESTIMATOR 




C25] 


MAX 




C26] 


'D1 = ' ,^D1 




C27] 


' DEGREES OF FREEDOM h ' ,<iTl 




C28] 


'PRAIS-WINSTEN 1ST ORDER ESTIMATE, 


PI B ' , <5PP 


C29] 


'Dn = ' ,$Z?4 




C30] 


' DEGREES OF FREEDOM b ' , $r4 




C31] 


' DURBIN- WATSON nTH ORDER ESTIMATE, 


P3 B ' , $P3 


C32] 


'D-W THEIL-NAGAR MOD. ESTIMATE, P4 


= ' ,$P4 


C33] 


'THE MAX LIKE ESTIMATOR IS , P9 b ' 


, !^PPML 



APL FUNCTION CHECKER 
CHECKER REEL 

Cl] ^THIS FUNCTION COMPUTES THE DURBIN- WATSON AND 
C2] qW ALLIS TEST STATISTICS ON THE RESIDUALS OF A 

[3] (^REGRESSION. THE INPUT RESL IS A VECTOR OF LENGTH 

[4] (^N OF RESIDUALS. 

C5] (^DETERMINE DEGREES OF FREEDOM 
C6] TCl^ipRESD-l 
C7] TCn^ipRESD-U- 

[8] AC^+/RESL*2 

[9] (^DETERMINE TEST STATISTICS 

CIO] DlC^iTAC)x + /iiiO ,TClpl)/RESL)- ( iTClpl) ,0) /RESL)*2 
Cll] DnC^ii-AC)^ + / iiiO , 0 , 0 ,0 ,TCnpl)/RESD- ( (rC4p 1 ) , 0 , 0 , 0 , 0 ) 
/RESL) *2 
C12] ' ' 

C13] 'AFTER TRANSFORMATION' 

C14] »D1 = ' ,^D1C 

CIS] 'DEGREES OF FREEDOM s ' ,(5TC\ 

C16] 'D4 = ' ,$D4C 

C17] 'DEGREES OF FREEDOM = ' ,$2’C4 
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APL FUNCTION VARTEST 



ALPHA VARTEST S 

[1] aTHIS FUNCTION PERFORMS THE F- TEST TO CHECK THE 
C2] (^HOMOGENEITY OF THE VARIANCE OF THE RESIDUALS, 

C3] WO^O.SxpS 

[4] SA-<rSB^NO^Q 

[ 5 ] DOF^NO- 1 

[ 6 ] D-^2pDOF 

C7] {^SERARATE THE RESIDUALS INTO TNO GROUPS, 

[ 8 ] SA^NO^S 

[9] SB-^NO-i'S 

CIO] ABAR^(-¥/SA)tNO 
Cll] BBAR^(+/SB)^NO 

[12] (^CALCULATE THE VARIANCES OF THE TWO GROUPS, 

[13] ASOS^+/iSA-ABAR)*2 

[14] BSOS^+/ (SB-BBAR)*2 

[15] (^CALCULATE THE TEST STATISTIC R AND THE CRITICAL 

[16] (^REGION, 

[17] R^{ASOS[BSOS)i{ASOSlBSOS) 

[18] Kl^D FQUANiALPHA^2) ■ 

[19] K2^D FQUAN{1- {ALPHA^2)) 

[20] (^DETERMINE THE LEVEL AT WHICH THE TEST STATISTIC IS 

[21] (^SIGNIFICANT, 

[22] ALPHAC^2x(l- (D FCENT R)) 

[23] 'FOR AN F TEST WITH ALPHA = ' ALPHA 

[24] .(^DETERMINE IF THE TEST STATISTIC R FALLS WITHIN 
[2 5] f\THE CRITICAL REGION, 

[26] ■*A^\(.iR>Kl)hiR<K2)) 

[27] 'REJECT Ho: VARl = VAR2' 

[28] -^NEXT 

[29] Al'ACCEPT Ho: VARl = VAR2' 

[30] NEXT:'F STATISTIC = ' ,9R 

[31] 'OBSERVED LEVEL OF SIGNIF, IS ALPHA s ' .9ALPHAC 
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Regres^on models of 
qua^-efly overhead 
:s for six govern- 
ment aerspace contrac- 
tors . 



